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Suggested Further Reading 

1. Quantum Processes, Systems, and Information, by Schumacher and Westmore- 
land, Cambridge University Press (2010). This is an excellent book, and should be 
your first choice for additional material. It has everything up to relativistic quantum 
mechanics. 

2. Quantum Information and Quantum Computation, by Nielsen and Chuang, 
Cambridge University Press (2000). This is the current standard work on quan- 
tum information theory. It has a comprehensive introduction to quantum mechanics 
along the lines treated here, but in more depth. The book is from 2000, which means 
that several important recent topics are not covered. 

3. Quantum Theory of Condensed Matter, by John Chalker, Oxford University lec- 
ture notes (2012). These lecture notes give an in-depth treatment of non -relativistic 
many-body problems in quantum mechanics. The notes are available at: 
http://www-thphys.physics.oxMCMk/people/JohnChalker/qtcm/lecture-notes.pdf 

4. Introductory Quantum Optics, by Gerry and Knight, Cambridge University Press 
(2005). This is a very accessible introduction to the quantum theory of light. 

5. Quantum Field Theory, by Lewis Ryder, Cambridge University Press (1996). This 
is a quite advanced introduction to relativistic quantum field theory. 



I would like to thank the students who have used these lecture notes in previous years 
and spotted typos or errors (notably Tom Bullock). Their efforts have greatly improved 
the readability of these notes. 
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1 Linear Vector Spaces and Hilbert Space 

The modern version of quantum mechanics was formulated in 1932 by John von Neu- 
mann in his famous book Mathematical Foundations of Quantum Mechanics, and it uni- 
fies Schrodingers wave theory with the matrix mechanics of Heisenberg, Born, and Jordan. 
The theory is framed in terms of linear vector spaces, so the first couple of lectures we have 
to remind ourselves of the relevant mathematics. 

1 .1 Linear vector spaces 

Consider a set of vectors, denoted by \y/), \(f>), etc., and the complex numbers a, b, c, etc. 
A linear vector space T is a mathematical structure of vectors and numbers that obeys the 
following rules: 

1. \y/) + \ (f>) = + \ y/) (commutativity), 

2. \xf/) + (\(f>) + \x)) = + |0>)+ \x) (associativity), 

3. a(\y) + \<p)) = a + a \<p) (linearity), 

4. (a + b)\xff) =a\\f/) + b\xff) (linearity), 

5. a(b\<p)) = (ab)\(p). 

There is also a null vector such that \y/) + = \y/), and for every \y/) there is a vector \(f>) 
such that \xf/) + \(p) = 0. 

For each vector |^) there is a dual vector (<f>\, and the set of dual vectors also form a 
linear vector space Y* . There is an inner product between vectors from Y and Y* denoted 
by {y/\(f>). The inner product has the following properties: 

1. <Vl0> = <W\ 

2. (xf/\xf/)>0, 

3. <i/^> = o |v> = 0, 

4. |^) = ci|i//i) + c 2 |^2> => (<PW) = c 1 ((p\-fi) + c 2 ((p\y2), 

5. ||(/)||= yj \(p\(p) is the norm of \<p). 

If || (p ||= 1, the vector \(p) is a unit vector. The set of unit vectors {e l<p with cp e [0,2n) 
form a so-called ray in the linear vector space. A linear vector space that has a norm || . || 
(there are many different ways we can define a norm) is called a Hilbert space. We will 
always assume that the linear vector spaces are Hilbert spaces. 

For linear vector spaces with an inner product we can derive the Cauchy-Schwarz 
inequality, also known as the Schwarz inequality: 



\{<p\y)\ 2 <{y\y){<p\<p). 



(1.1) 
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This is a very important relation, since it requires only the inner product structure. Re- 
lations that are based on this inequality, such as the Heisenberg uncertainty relation 
between observables, therefore have a very general validity. 

If two vectors have an inner product equal to zero, then these vectors are called orthog- 
onal. This is the definition of orthogonality. When these vectors are also unit vectors, they 
are called orthonormal. A set of vectors \<pi), \(f>2),- ■ ■ |</>iv) are linearly independent if 

E«y|^-> = (1.2) 

j 

implies that all aj = 0. The maximum number of linearly independent vectors in Y is 
the dimension of Y. Orthonormal vectors form a complete orthonormal basis for Y if any 
vector can be written as 

N 

\v)=L c k\<l>k), d.3) 
k=l 

and {(pj\<Pk) = Sjk- We can take the inner product of \y) with any of the basis vectors \(pj) 
to obtain 

N N 

((pjly) = £ c k {(pj\(pk) = £ c k 5j k = Cj. (1.4) 
k=i k=i 

Substitute this back into the expansion of \ifr), and we find 

N 

\v)=L\<l>k)(<l>k\V). (1.5) 
k=l 

Therefore Y.k must act like the identity. In fact, this gives us an important clue 

that operators of states must take the general form of sums over objects like \(p) {%\. 

1 .2 Operators in Hilbert space 

The objects \ f) are vectors in a Hilbert space. We can imagine applying rotations of the 
vectors, rescaling, permutations of vectors in a basis, and so on. These are described 
mathematically as operators, and we denote them by capital letters A, B, C, etc. In general 
we write 

A\<p) = \ V j), (1.6) 

for some \<p), \if/) eY. It is important to remember that operators act on all the vectors 
in Hilbert space. Let {\<pj)}j be an orthonormal basis. We can calculate the inner product 
between the vectors \<pj) and A \<Pk)'- 

((Pj\{A\(p k )) = {(Pj\A\(p k )=A jk . (1.7) 
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The two indices indicate that operators are matrices. 

As an example, consider two vectors, written as two-dimensional column vectors 

M = (J), M = (J), d.8) 

and suppose that 

<2 



A = 
We calculate 



[0 3 



(1.9) 



An = <0i|A|0i> = (l,O)-g 3)(J) = (1.0)-(q) = 2. (1.10) 

Similarly, we can calculate that A22 = 3, and A 12 = A21 = (check this). We therefore have 
that A \(pi) = 2 |0!> and A |0 2 ) = 3 \(p 2 ). 

Complex numbers a have complex conjugates a* and vectors \y/) have dual vectors 
((p\. Is there an equivalent for operators? The answer is yes, and it is called the adjoint, 
or Hermitian conjugate, and is denoted by a dagger t- The natural way to define it is 
according to the rule 

(x(,\A\ct>)* = {<p\At\y,), (1.11) 

for any \<p) and \y/). In matrix notation, and given an orthonormal basis {\<pj)}j, this 
becomes 

{<P J \A\<p k y=A* jk = {<p k \A^\<p j )=Al j . (1.12) 

So the matrix representation of the adjoint A^ is the transpose and the complex conjugate 
of the matrix A, as given by (A*)jk = A* k j. The adjoint has the following properties: 

1. (cA)t = c*A\ 

2. (AB) t =S t A t > 

3. <|0»t = <0|. 

Note the order of the operators in 2: AB is generally not the same as BA. The difference 
between the two is called the commutator, denoted by 

[A,B]=AB-BA. (1.13) 

For example, we can choose 



A 



1^ , „ (1 



U 0; and s = [o -1 ■ <114) 
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which leads to 



IA,B] 



1H1 

,1 o, 

-2 

2 





(0 -1 
^0. 



1 

lo -1 



1 

U o 



-1 
,1 o 



' 1 
[-1 



(1.15) 



Many, but not all, operators have an inverse. Let A \(f>) - \y/) and B\y/) = \(p). Then we 
have 



BA\(p) = \<p) and AB\y/) = \y/). 



(1.16) 



If Eq. (1.16) holds true for all |(/>) and thenfi is the inverse of A, and we write B = A -1 . 
An operator that has an inverse is called invertible. Another important property that an 
operator may possess is positivity. An operator is positive if 



(0|A|0)>O for all |0> 



(1.17) 



We also write this as A > 0. 

From the matrix representation of operators you can easily see that the operators 
themselves form a linear vector space: 



1. A+B=B+A, 



2. A + (B + C) = (A+B) + C, 

3. a(A+B) = aA + aB, 



4. (a + b)A = aA + bA, 



5. a(bA) = (ab)A. 



We can also define the operator norm || A || according to 




(1.18) 



which means that the linear vector space of operators is again a Hilbert space. The symbol 
Tr(.) denotes the trace of an operator, and we will return to this special operator property 
later in this section. 

Every operator has a set of vectors for which 



(1.19) 
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This is called the eigenvalue equation (or eigenequation) for A, and the vectors \aj) are 
the eigenvectors. The complex numbers aj are eigenvalues. In the basis of eigenvectors, 
the matrix representation of A becomes 





a 2 



A 




(1.20) 



\0 ••• a N j 

When some of the a/s are the same, we speak of degenerate eigenvalues. When there are re 
identical eigenvalues, we have re-fold degeneracy. The eigenvectors corresponding to this 
eigenvalue then span an re-dimensional subspace of the vector space. We will return to 
subspaces shortly, when we introduce projection operators. 
For any orthonormal basis {| (f>j)}j we have 

(cf>j\A\(f> k )=A jk , (1.21) 

which can be written in the form 



A = Y J A jk \<p j ){<p k \. (1.22) 

jk 

For the special case where \(f)j) = \aj) this reduces to 

A = L°j\°jK°j\- d-23) 

j 

This is the spectral decomposition of A. When all aj are equal, we have complete degener- 
acy over the full vector space, and the operator becomes proportional (up to a factor aj) to 
the identity D. Note that this is independent of the basis {|a/)}. As a consequence, for any 
orthonormal basis {| </>/)} we have 

l = Z>y>tol- (1.24) 

j 

This is the completeness relation, and we will use this many times in our calculations. 

Lemma: If two non-degenerate operators commute ([A,B] = 0), then they have a common 
set of eigenvectors. 

Proof: Let A = Y.k a k \ a k) ( a k I an d B = Y.jk Bjk \ a j) ( a k I- We can choose this without loss of 
generality: we write both operators in the eigenbasis of A. Furthermore, [A,B] = 
implies that AB = BA. 

AB=Y. a kBi m \a k ) (a k \ai) (a m \ = ^ a i B im \a>i) (ami 

klm Im 

BA = £ a k B im \ai) (a m \a k ) (a k \ = Y, a mBlm l«z> (a m \ (1.25) 

klm Im 
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Therefore 

)B lm \ai)(a m \ = 0. (1.26) 

lm 

If a/ ^ a m for I ^ m, then = 0, and Bi m oc <5; m . Therefore {\aj}} is an eigenbasis 
for B. □ 

The proof of the converse is left as an exercise. It turns out that this is also true when A 
and/or B are degenerate. 

1.3 Hermitian and unitary operators 

Next, we will consider two special types of operators, namely Hermitian and unitary oper- 
ators. An operator A is Hermitian if and only if A^ = A. 

Lemma: An operator is Hermitian if and only if it has real eigenvalues: A* = A o aj £ U. 
Proof: The eigenvalue equation of A implies that 

M a j) = a A a j) => M^ = a*j(aj\, (1.27) 



which means that (aj\ A \aj) = aj and (a,j\ A' \aj) = a*. It is now straightforward to 

(°j\A\ a j) = ( a j\ A *\ a j)' ( L28 > 



show that A=A j < implies aj = a*, or aj e R. Conversely, aj e U implies aj = a*, and 



Let \y) = LkCk Then 

<y| A |yr> = £ \cj\ 2 (aj\ A \aj) = £ |c/ (aj \ A^ \ aj ) = £ |c/ <o,| A* | B</ ) 
y j i 

= <^|A t |^) (1.29) 

for all \ and therefore A = A^. □ 

Next, we construct the exponent of an operator A according to U = exp(icA). We have 
included the complex number c for completeness. At first sight, you may wonder what it 
means to take the exponent of an operator. Recall, however, that the exponent has a power 
expansion: 

00 (ic) n 

U = exp(icA) = V —-A n . (1.30) 
rc= n - 

The « th power of an operator is straightforward: just multiply A n times with itself. The 
expression in Eq. (1.30) is then well denned, and the exponent is taken as an abbreviation 
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of the power expansion. In general, we can construct any function of operators, as long as 
we can define the function in terms of a power expansion: 

oo 

f(A)=Y.fnA n . (1.31) 

This can also be extended to functions of multiple operators, but now we have to be very 
careful in the case where these operators do not commute. Familiar rules for combining 
normal functions no longer apply (see exercise 4b). 

We can construct the adjoint of the operator U according to 

* = ^pA") = £ t!^L A U = expMc*At). (1.32) 

In the special case where A=A^ and c is real, we calculate 

UU* = exp(icA)exp(-ic*A t ) = exp(icA)exp(-icA) = exp[ic(A - A)] = D, (1.33) 

since A commutes with itself. Similarly, U'U = 0. Therefore, U* = U , and an operator 
with this property is called unitary. Each unitary operator can be generated by a Hermi- 
tian (self-adjoint) operator A and a real number c. A is called the generator oft/. We often 
write U = Ua(c). Unitary operators are basis transformations. 

1 .4 Projection operators and tensor products 

We can combine two linear vector spaces °l£ and Y into a new linear vector space TV = °lt ®V . 
The symbol © is called the direct sum. The dimension of TV is the sum of the dimensions 
of^ and Y: 

dim^ = dim^ + dimy. (1.34) 

A vector in TV can be written as 

l^>3r=k)* + |0)y. d-35) 

where \yj)<% and \(p) r are typically not normalized (i.e., they are not unit vectors). The 
spaces °U and Y are so-called subspaces ofW. 

As an example, consider the three-dimensional Euclidean space spanned by the Carte- 
sian axes x, y, and z. The xy-plane is a two-dimensional subspace of the full space, and 
the 2-axis is a one-dimensional subspace. Any three-dimensional form can be projected 
onto the xy-plane by setting the z component to zero. Similarly, we can project onto the 
2-axis by setting the x and y coordinates to zero. A projector is therefore associated with 
a subspace. It acts on a vector in the full space, and forces all components to zero, except 
those of the subspace it projects onto. 
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The formal definition of a projector P% on °U is given by 

Pvmr=\Y) ete - d-36) 

This is equivalent to requiring that P^, = P^, or P^r is idempotent. One-dimensional 
projectors can be written as 

Pj=Wj){<Pj\- d-37) 

Two projectors Pi and P2 are orthogonal is P1P2 = 0. If P1P2 = 0, then Pi +P 2 is another 
projector: 

(Pi +P 2 ) 2 = P? + P1P2 +P 2 Pi +P| = Pi +Pf =Pi +P 2 • d-38) 

When Pi and P2 commute but are non-orthogonal (i.e., they overlap), the general projector 
onto their combined subspace is 

Pi +2 =Pi+P 2 -PiP 2 . (1.39) 

(Prove this.) The orthocomplement of P is D -P, which is also a projector: 

P(D-P) = P-P 2 =P-P = and (0 - P) 2 = - 2P + P 2 = - P . (1.40) 

Another way to combine two vector spaces °U and Y is via the tensor product: TV = 
®Y, where the symbol ® is called the direct product or tensor product. The dimension 
of the space TV is then 

dimr = dim<^-dim7. (1.41) 

Let \yr) e °U and \<p) e 7. Then 

\y/)®\(f>)eW = ^®Y. (1.42) 

If \ f) = L/O/|V0') and \<p) = Y.jbj\(pj), then the tensor product of these vectors can be 
written as 

k) ® |0> = Y. a J h k \Vj) ®\<Pk) = Y. a J h k \Vj) \<t>k) = E a A \Vj,<l>k) , d-43) 
jk jk jk 

where we introduced convenient abbreviations for the tensor product notation. The inner 
product of two vectors that are tensor products is 

((i^i|®(0i|)(k 2 )® |02)) = <VllV2><0ll02>. (1-44) 
Operators also obey the tensor product structure, with 

(A »B)\yr) ® |0> = (A |^>) ® (B \<p)), (1.45) 

and 

(A®B)(C®D)\i(/) ® \(p) = (AC \y/)) ® (BD (1.46) 
General rules for tensor products of operators are 
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1. A®0 = 0®B = 0, 

2. ® = 0, 

3. (Ai+A 2 )®fi = Ai®B + A 2 ®B, 

4. aA®bB = (ab)A®B, 

5. (A®Br 1 =A- 1 ®B~ 1 , 

6. (A®5) t =A t ®B t . 

Note that the last rule preserves the order of the operators. In other words, operators 
always act on their own space. Often, it is understood implicitly which operator acts on 
which subspace, and we will write A ® D = A and D ®B - B. Alternatively, we can add 
subscripts to the operator, e.g., A<% and By. 

As a practical example, consider two two-dimensional operators 

A = (f A A and H = g" B A 

with respect to some orthonormal bases {|ai> , |a2>} and {|6i> , I&2)} for A and B, respectively 
(not necessarily eigenbases). The question is now: what is the matrix representation of 
A ®B? Since the dimension of the new vector space is the product of the dimensions of the 
two vector spaces, we have dimW = 2-2 = 4. A natural basis for A ® B is then given by 
{\aj,b k )}jk, with j,k = 1,2, or 

|oi>|6i>, |ai>|6 2 >, la 2 >|6i>, |a 2 > \b 2 ) . (1.48) 

We can construct the matrix representation of A ®B by applying this operator to the basis 
vectors in Eq. (1.48), using 

A\a j )=A 1 j\a 1 )+A 2 j\a 2 ) and B \a k ) =B lk |6i) +B 2k \b 2 ) , d-49) 

which leads to 

A®B\a 1 ,b 1 ) = (An lai> + A 2 i |a 2 »(Bn +B 2 i |6 2 » 
A®B\a 1 ,b 2 ) = (An \a 1 )+A 2 i \a 2 ))(B 12 \b x )+B 22 \b 2 )) 
A®B\a 2 ,bi) = (A 12 |ai) + A 22 |a 2 >)(fin \bi)+B 21 \b 2 )) 

A®B\a 2 ,b 2 ) = (A is |ai> + A 22 |a 2 »(Bi 2 l&i) +^22 \b 2 )) (1.50) 

Looking at the first line of Eq. (1.50), the basis vector |ai,6i) gets mapped to all basis 
vectors: 



A ®B |ai,6i> = An^ii |ai,6i> + Anfi 2 i \a\,b 2 ) + A 21 B 1X |a 2 ,&i> + A 2 i£ 2 i la 2 ,&2> ■ (1.51) 
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Combining this into matrix form leads to 



A®B = 



-Aii-Bii A11B12 Ai 2 Bii A12B12 

AuB 2 i AuB 2 2 A 12 B 2 i A 12 B 2 2 

A 2 iBn A21B12 A22-B11 A22B12 

A21B21 A21B22 A22-B21 A22-B22 y 



A n B A 12 B\ 
[A 21 B A 22 fiJ' 



(1.52) 



Recall that this is dependent on the basis that we have chosen. In particular, A < 
be diagonalized in some other basis. 



'B may 



1 .5 The trace and determinant of an operator 

There are two special functions of operators that play a key role in the theory of linear 
vector spaces. They are the trace and the determinant of an operator, denoted by Tr(A) and 
det(A), respectively. While the trace and determinant are most conveniently evaluated in 
matrix representation, they are independent of the chosen basis. 

When we defined the norm of an operator, we introduced the trace. It is evaluated by 
adding the diagonal elements of the matrix representation of the operator: 

Tr(A) = £> 7 |A|0;), (1.53) 

j 

where {\<pj)}j is any orthonormal basis. This independence means that the trace is an 
invariant property of the operator. Moreover, the trace has the following important prop- 
erties: 



1. If A = A\ then Tr(A) is real, 

2. Tr(aA) = aTr(A), 

3. Tr(A+5) = Tr(A) + TrtB), 

4. Tr(Afi) = Tr(fiA) (the "cyclic property"). 

The first property follows immediately when we evaluate the trace in the diagonal basis, 
where it becomes a sum over real eigenvalues. The second and third properties convey 
the linearity of the trace. The fourth property is extremely useful, and can be shown as 
follows: 

Tr( AB ) = £ < <Pj I AB 1 0; > = £ < cf>j | A | Vk > < yr k | B | <Pj > 

j Jk 

= L{n\B\4>j)(4>j\A\yr k ) = L{n\BA\yf k ) 
jk k 

= Tr(BA) . (1.54) 
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This derivation also demonstrates the usefulness of inserting a resolution of the identity 
in strategic places. In the cyclic property, the operators A and B may be products of two 
operators, which then leads to 

Tr(ABC) = Tr(BCA) = Tr(CAB) . (1.55) 

Any cyclic (even) permutation of operators under a trace gives rise to the same value of 
the trace as the original operator ordering. 

Finally, we construct the partial trace of an operator that lives on a tensor product 
space. Suppose that A ® B is an operator in the Hilbert space J€\ ® J€ 2 . We can trace out 
Hilbert space J€\, denoted by Tri(.): 

Tri(A®5) = Tr(A)5, or equivalently Tri(Aifi 2 ) = Tr(Ai)£ 2 • (1.56) 

Taking the partial trace has the effect of removing the entire Hilbert space J€\ from the 
description. It reduces the total vector space. The partial trace always carries an index, 
which determines which space is traced over. 
The determinant of a 2 x 2 matrix is given by 

det(A) = det|^ ^=A n A 22 -A 12 A 21 . (1.57) 

The determinant of higher-dimensional matrices can be defined recursively as follows: 
The top-left element of an n x n matrix defines an (n - 1) x (n - 1) matrix by removing the 
top row and the left column. Similarly, any other element in the left column defines an 
(re - 1) x (n- 1) matrix by removing the left column and the row of the element we chose. The 
determinant of the n x n matrix is then given by the top-left element times the determinant 
of the remaining (n - 1) x (n - 1) matrix, minus the product of the second element down in 
the left column and the remaining (n - 1) x (n - 1) matrix, plus the third element times the 
remaining matrix, etc. 

The determinant of the product of matrices is equal to the product of the determinants 
of the matrices: 

det(A5) = det(A)detCB). (1.58) 

Moreover, if A is an invertible matrix, then we have 

det(A- 1 ) = det(Ar 1 . (1.59) 

This leads to an important relation between similar matrices A = X~ 1 BX: 

det(A) = det(X _1 5X) = det(X _1 )det(B)det(X) = det(X)- 1 det(B)det(X) = det(B). (1.60) 

In particular, this means that the determinant is independent of the basis in which the 
matrix is written, which means that it is an intrinsic property of the operator associated 
with that matrix. 

Finally, here's a fun relation between the trace and the determinant of an operator: 



det[exp(A)] = exp[Tr(A)] . 



(1.61) 
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Exercises 

1. Vectors and matrices: 

(a) Are the following three vectors linearly dependent or independent: a = (2, 3, - 1), 
b = (0,1,2), and c = (0, 0,-5)? 

(b) Consider the vectors \y/) = 3i \(pi) - li \(f>2) and \%) = \<Pi) + 2 \<p2), with {\(f>i)} an 
orthonormal basis. Calculate the inner product between \ifr) and \%), and show 
that they satisfy the Cauchy-Schwarz inequality 

(c) Consider the two matrices 



(1.62) 





f° 


i 


2) 




12 i 


^ 


A = 





1 





and B = 


3 1 


5 




U 





oj 




,0 -i 


-v 



Calculate A 1 B and BA . Are they equal? 
Hint: find the inverse of A in the diagonal basis. 

(0 1 



(d) Calculate A ® B and B ® A, where A 



1 



and B 



'1 








2. Operators: 

(a) Which of these operators are Hermitian: A+A\ i(A+A f ), i(A-A f ), and AU? 

(b) Prove that a shared eigenbasis for two operators A and B implies that [A,fi] = 
0. 

(c) Let U be a transformation matrix that maps one complete orthonormal basis to 
another. Show that U is unitary. 

(d) How many real parameters completely determine ad x d unitary matrix? 

3. Properties of the trace and the determinant: 

(a) Calculate the trace and the determinant of the matrices A and B in exercise lc. 

(b) Show that the expectation value of A can be written as Tr(|i//) A). 

(c) Prove that the trace is independent of the basis. 

4. Commutator identities. 

(a) Let F(t) = e At e Bt . Calculate dF/dt and use [e At ,B] = (e At Be~ At -B)e At to sim- 
plify your result. 

(b) Let Git) = e At+Bt+ f (t)H . Show by calculating dG/dt, and setting dF/dt = dGldt 
at t = 1, that the following operator identity 



„AB 

e e 



e A+B+\[A,B} 



(1.63) 
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holds if A and B both commute with [A, B]. Hint: use the Hadamard lemma 

t t 2 

e At Be -At = B + -[A,B] + -[A, [A,B]] + ... (1.64) 

(c) Show that the commutator of two Hermitian operators is anti-Hermitian (A^ = 
-A). 

(d) Prove the commutator analog of the Jacobi identity 

[A,[B,C]] + [B,[C,A]] + [C,[A,5]] = 0. (1.65) 
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2 The Postulates of Quantum Mechanics 

The entire structure of quantum mechanics (including its relativistic extension) can be 
formulated in terms of states and operations in Hilbert space. We need rules that map the 
physical quantities such as states, observables, and measurements to the mathematical 
structure of vector spaces, vectors and operators. There are several ways in which this can 
be done, and here we summarize these rules in terms of five postulates. 

Postulate 1: A physical system is described by a Hilbert space J€, and the state of the 
system is represented by a ray with norm 1 in J€. 

There are a number of important aspects to this postulate. First, the fact that states are 
rays, rather than vectors means that an overall phase e l<p of the state does not have any 
physically observable consequences, and e l(p \ifr) represents the same state as \ Second, 
the state contains all information about the system. In particular, there are no hidden 
variables in this standard formulation of quantum mechanics. Finally, the dimension of J€ 
may be infinite, which is the case, for example, when J€ is the space of square-integrable 
functions. 

As an example of this postulate, consider a two-level quantum system (a qubit). This 
system can be described by two orthonormal states |0) and |1>. Due to linearity of Hilbert 
space, the superposition a|0) + /3|1) is again a state of the system if it has norm 1, or 



This is called the superposition principle: any normalised superposition of valid quantum 
states is again a valid quantum state. It is a direct consequence of the linearity of the 
vector space, and as we shall see later, this principle has some bizarre consequences that 
have been corroborated in many experiments. 

Postulate 2: Every physical observable A corresponds to a self-adjoint (Hermitian 1 ) op- 
erator A whose eigenvectors form a complete basis. 

We use a hat to distinguish between the observable and the operator, but usually this 
distinction is not necessary. In these notes, we will use hats only when there is a danger 
of confusion. 

As an example, take the operator X: 



This operator can be interpreted as a bit flip of a qubit. In matrix notation the state vectors 
can be written as 



(a*<0| + j8*<l|)(a|0> + /3|l>) = l or |a| 2 + |^| 2 = l. 



(2.1) 



Z|0) = |1) and X|1> = |0>. 



(2.2) 




and |1) 



t 







1 



(2.3) 



1 In Hilbert spaces of infinite dimensionality, there are subtle differences between self-adjoint and Hermi- 
tian operators. We ignore these subtleties here, because we will be mostly dealing with finite-dimensional 
spaces. 
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i -l? l\ (2.4) 



which means that X is written as 

Y = i 

[1 

with eigenvalues ±1. The eigenstates of X are 

l±>=^>. (2.5) 
These states form an orthonormal basis. 

Postulate 3: The eigenvalues of A are the possible measurement outcomes, and the prob- 
ability of finding the outcome aj in a measurement is given by the Born rule: 

p(aj)=\(aj\f)\ 2 , (2.6) 

where \y/) is the state of the system, and \aj) is the eigenvector associated with the 
eigenvalue aj via A |ay) = aj \aj). If ay is m-fold degenerate, then 

m 

.«)|..A|2 



p(a j ) = Y l \<a7\V)r, (2-7) 
l=i 



where the 



span the m-fold degenerate subspace. 



The expectation value of A with respect to the state of the system \y/) is denoted by (A), 
and evaluated as 

(A) = (y,\A\y,) = (y,\ j^aj \aj) (aj\j \y,) = ^piaj^j. (2.8) 

This is the weighted average of the measurement outcomes. The spread of the measure- 
ment outcomes (or the uncertainty) is given by the variance 

(AA) 2 = <(A - <A» 2 > = <A 2 > - <A> 2 . (2.9) 

So far we mainly dealt with discrete systems on finite-dimensional Hilbert spaces. But 
what about continuous systems, such as a particle in a box, or a harmonic oscillator? We 
can still write the spectral decomposition of an operator A but the sum must be replace by 
an integral: 

A = Jdaf A (a)\a){a\, (2.10) 

where \a) is an eigenstate of A. Typically, there are problems with the normalization of 
\a), which is related to the impossibility of preparing a system in exactly the state \a). We 
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will not explore these subtleties further in this course, but you should be aware that they 
exist. The expectation value of A is 

(A) = (y/\A\y/) = J daf A (a)(y\a)(a\y) = J da f A (a)\y(a)\ 2 , (2.11) 

where we defined the wave function y/(a) = (a\y/), and \y/(a)\ 2 is properly interpreted as the 
probability density that you remember from second-year quantum mechanics. 

The probability of finding the eigenvalue of an operator A in the interval a and a + da 
given the state is 

(tl/\(\a){a\da)\y/) = dp(a), (2.12) 
since both sides must be infinitesimal. We therefore find that 

^ = l#)l 2 . (2.13) 
da 

Postulate 4: The dynamics of quantum systems is governed by unitary transformations. 

We can write the state of a system at time t as \i(/(t)), and at some time to < t as \y/(to)). 
The fourth postulate tells us that there is a unitary operator t/(Mo) that transforms the 
state at time to to the state at time t: 

\yf(t)) = U(t,to)\v(t )). (2.14) 

Since the evolution from time t to t is denoted by U (t, t) and must be equal to the identity, 
we deduce that U depends only on time differences: U (t, to) = U(t - to), and U(0) = I. 
As an example, let U(t) be generated by a Hermitian operator A according to 

E7(f) = exp|-iAfJ. (2.15) 

The argument of the exponential must be dimensionless, so A must be proportional to H 
times an angular frequency (in other words, an energy). Suppose that \y/(t)) is the state of 
a qubit, and that A = hwX. If |i^(0)) = |0) we want to calculate the state of the system at 
time t. We can write 



00 (-iu)t) n 

\y(t)) = U(t)\y/(0)) = exp(-iajtX)\0)= £ -—X n . (2.16) 

71 = U - 



Observe that X 2 = D, so we can separate the power series into even and odd values of re: 

00 (-ia)t) 2n 00 (-io)t) 2n+1 
M*))=L , r, 10) +E ,o im ^IO> = cos(ft>t)IO>-tsin(ft)t)|l>. (2.17) 

n =0 ( Zn )- n=0 K^n + 1)\ 



In other words, the state oscillates between |0> and |1>. 
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The fourth postulate also leads to the Schrodinger equation. Let's take the infinitesi- 
mal form of Eq. (2.14): 

\y(t + dt)) = U(dt)\y(t)). (2.18) 
We require that U(dt) is generated by some Hermitian operator H: 



U(dt) = exp 



i 



-Hdt\. (2.19) 

ft 



H must have the dimensions of energy, so we identify it with the energy operator, or the 
Hamiltonian. We can now take a Taylor expansion of \if/(t + dt)) to first order in dt: 

d 

\y(t + dt)) = \y(t)) + dt—\y(t)) + ..., (2.20) 
and we expand the unitary operator to first order in dt as well: 

U{dt) = l- l -Hdt + ... (2.21) 
ft 



We combine this into 



\y/(t)) + dt^- \iff(t)) = f 1 - ^Hdt) \y(t)) , (2.22) 
dt \ H i 

which can be recast into the Schrodinger equation: 

ih— \ifr(t))=H\y/(t)). (2.23) 
Therefore, the Schrodinger equation follows directly from the postulates! 

Postulate 5: If a measurement of an observable A yields an eigenvalue aj, then immedi- 
ately after the measurement, the system is in the eigenstate \aj) corresponding to 
the eigenvalue. 

This is the infamous projection postulate, so named because a measurement "projects" 
the system to the eigenstate corresponding to the measured value. This postulate has as 
observable consequence that a second measurement immediately after the first will also 
find the outcome aj. Each measurement outcome aj corresponds to a projection operator 
Pj on the subspace spanned by the eigenvector(s) belonging to aj. A (perfect) measurement 
can be described by applying a projector to the state, and renormalize: 



This also works for degenerate eigenvalues. 
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We have established earlier that the expectation value of A can be written as a trace: 

(A)=Tr(\y/)(y\A). (2.25) 

Now instead of the full operator A, we calculate the trace of Pj = \aj) (aj\: 

(Pj) =Tr(|^)<^|P J -) = Tr(|^)<^|a i )<a 7 -|)= |<a,|0| 2 =p(aj). (2.26) 

So we can calculate the probability of a measurement outcome by taking the expectation 
value of the projection operator that corresponds to the eigenstate of the measurement 
outcome. This is one of the basic calculations in quantum mechanics that you should be 
able to do. 

The projection postulate is somewhat problematic for the interpretation of quantum 
mechanics, because it leads to the so-called measurement problem: Why does a measure- 
ment induce a non-unitary evolution of the system? After all, the measurement apparatus 
can also be described quantum mechanically 2 and then the system plus the measurement 
apparatus evolves unitarily. But then we must invoke a new device that measures the 
combined system and measurement apparatus. However, this in turn can be described 
quantum mechanically, and so on. 

On the other hand, we do see definite measurement outcomes when we do experiments, 
so at some level the projection postulate is necessary, and somewhere there must be a 
"collapse of the wave function". Schrodinger already struggled with this question, and 
came up with his famous thought experiment about a cat in a box with a poison-filled vial 
attached to a Geiger counter monitoring a radioactive atom (see Fig. 1). When the atom 
decays, it will trigger the Geiger counter, which in turn causes the release of the poison 
killing the cat. When we do not look inside the box (more precisely: when no information 
about the atom-counter-vial-cat system escapes from the box), the entire system is in a 
quantum superposition. However, when we open the box, we do find the cat either dead 

2 This is something most people require from a fundamental theory: quantum mechanics should not just 
break down for macroscopic objects. Indeed, experimental evidence of macroscopic superpositions has been 
found in the form of "cat states". 
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or alive. One solution of the problem seems to be that the quantum state represents our 
knowledge of the system, and that looking inside the box merely updates our information 
about the atom, counter, vial and the cat. So nothing "collapses" except our own state of 
mind. 

However, this cannot be the entire story, because quantum mechanics clearly is not just 
about our opinions of cats and decaying atoms. In particular, if we prepare an electron in 
a spin "up" state 1 1>, then whenever we measure the spin along the 2-direction we will find 
the measurement outcome "up", no matter what we think about electrons and quantum 
mechanics. So there seems to be some physical property associated with the electron that 
determines the measurement outcome and is described by the quantum state. 

Various interpretations of quantum mechanics attempt to address these (and other) 
issues. The original interpretation of quantum mechanics was mainly put forward by 
Niels Bohr, and is called the Copenhagen interpretation. Broadly speaking, it says that 
the quantum state is a convenient fiction, used to calculate the results of measurement 
outcomes, and that the system cannot be considered separate from the measurement 
apparatus. Alternatively, there are interpretations of quantum mechanics, such as the 
Ghirardi-Rimini-Weber interpretation, that do ascribe some kind of reality to the state 
of the system, in which case a physical mechanism for the collapse of the wave function 
must be given. Many of these interpretations can be classified as hidden variable the- 
ories, which postulate that there is a deeper physical reality described by some "hidden 
variables" that we must average over. This in turn explains the probabilistic nature of 
quantum mechanics. The problem with such theories is that these hidden variables must 
be quite weird: they can change instantly depending on events lightyears away 3 , thus vi- 
olating Einstein's theory of special relativity. Many physicists do not like this aspect of 
hidden variable theories. 

Alternatively, quantum mechanics can be interpreted in terms of "many worlds": the 
Many Worlds interpretation states that there is one state vector for the entire universe, 
and that each measurement splits the universe into different branches corresponding to 
the different measurement outcomes. It is attractive since it seems to be a philosophically 
consistent interpretation, and while it has been acquiring a growing number of supporters 
over recent years 4 , a lot of physicists have a deep aversion to the idea of parallel universes. 

Finally, there is the epistemic interpretation, which is very similar to the Copenhagen 
interpretation in that it treats the quantum state to a large extent as a measure of our 
knowledge of the quantum system (and the measurement apparatus). At the same time, it 
denies a deeper underlying reality (i.e., no hidden variables). The attractive feature of this 
interpretation is that it requires a minimal amount of fuss, and fits naturally with current 
research in quantum information theory. The downside is that you have to abandon simple 
scientific realism that allows you to talk about the properties of electrons and photons, and 
many physicists are not prepared to do that. 

3 . . . even though the averaging over the hidden variables means you can never signal faster than light. 
4 There seems to be some evidence that the Many Worlds interpretation fits well with the latest cosmo- 
logical models based on string theory 
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As you can see, quantum mechanics forces us to abandon some deeply held (classical) 
convictions about Nature. Depending on your preference, you may be drawn to one or 
other interpretation. It is currently not know which interpretation is the correct one. 



Exercises 

1. Calculate the eigenvalues and the eigenstates of the bit flip operator X, and show 
that the eigenstates form an orthonormal basis. Calculate the expectation value of 
Xfor |^) = l/ v / 3|0) + iv / 273|l). 

2. Show that the variance of A vanishes when is an eigenstate of A. 

3. Prove that an operator is Hermitian if and only if it has real eigenvalues. 

4. Show that a qubit in an unknown state \y/) cannot be copied. This is the no-cloning 
theorem. Hint: start with a state for some initial state and require that 
for \y/) = |0) and \y/) = |1> the cloning procedure is a unitary transformation |0) |i> — ► 
|0>|0> and |1> |i> ^ |1> |1>. 

5. The uncertainty principle. 

(a) Use the Cauchy-Schwarz inequality to derive the following relation between 
non-commuting observables A and B: 

(AA) 2 (AS) 2 >7l<[A,S]>| 2 . (2.27) 
4 

Hint: define \ f) = (A - (A))\y/) and \g) = i(B - (B))\y/), and use that \(f\g)\ > 
l\(f\g) + (g\f)\. 

(b) Show that this reduces to Heisenberg's uncertainty relation when A and B are 
canonically conjugate observables, for example position and momentum. 

(c) Does this method work for deriving the uncertainty principle between energy 
and time? 



6. Consider the Hamiltonian H and the state \y/) given by 



H = E 



i 0\ 
-i 
^0 0-1) 



and \y) = 



fl-i] 

1-i 

1 I 



(2.28) 



where E is a constant with dimensions of energy. Calculate the energy eigenvalues 
and the expectation value of the Hamiltonian. 

Show that the momentum and the total energy can be measured simultaneously only 
when the potential is constant everywhere. What does a constant potential mean in 
terms of the dynamics of a particle? 
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3 Schrodinger and Heisenberg Pictures 

So far we have assumed that the quantum states \tff(t)) describing the system carry the 
time dependence. However, this is not the only way to keep track of the time evolution. 
Since all physically observed quantities are expectation values, we can write 

(A) = Tr [|^(*)> (y/(t)\ A] = Tr \u(t) \y/(0)) <^(0)| U\t)A 



Tr 



|^(0)) <^(0)| Z7 t (*)AZ7(^)| (cyclic property) 



;Tr[|i/A(0))(v(0)|A(*)], 



(3.1) 



where we defined the time-varying operator A(t) = U^(t)AU(t). Clearly, we can keep track 
of the time evolution in the operatorsl 

• Schrodinger picture: Keep track of the time evolution in the states, 

• Heisenberg picture: Keep track of the time evolution in the operators. 

We can label the states and operators "S" and "if" depending on the picture. For example, 

\Vh) = \Vs(0)) and A H (t) = uHt)A s U(t). (3.2) 

The time evolution for states is given by the Schrodinger equation, so we want a corre- 
sponding "Heisenberg equation" for the operators. First, we observe that 



U(t) = exp 



Ht 



such that 



(3.3) 



-U(t) 



HU(t). 



dt h 
Next, we calculate the time derivative of (A): 

—Tr[\xfr s (t)){y/s(t)\As] = ^(Vs(t)\ A s \ws(t)) = ^-{VH\A H it)\VH) ■ 

The last equation follows from Eq. (3.1). We can now calculate 
d 



dt 



(y/ s (t)\A s \y/ s (t)) 



(y/ s (0)\ [uHt)A s U(t) + uHt)A s U(t) + uHt)A s U(t)] \y/ s (0)) 

dA H (tY 



i 



l -HA H (t)- l -A H (t)H + 
n n 



(VH\lA H (t),H]\y/ H ) + (y/ H 



= (ysH 



dA H (t) 
dt 



\Vh). 



dt 

dA H (t) 
dt 



(3.6) 



\Vh) 
\Vh) 



(3.4) 



(3.5) 



d_ 
dt 



(y/ s (0)\uHt)A s U(t 
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Since this must be true for all \i^h), this is an operator identity: 

dA H (t) i vA dA H (t) 

t: =--[A H (t),H]+ ■ (3.7) 

This is the Heisenberg equation. Note the difference between the "straight d" and the 
"curly 3" in the time derivative and the partial time derivative, respectively. The partial 
derivative deals only with the explicit time dependence of the operator. In many cases 
(such as position and momentum) this is zero. 

We have seen that both the Schrodinger and the Heisenberg equation follows from Von 
Neumann's Hilbert space formalism of quantum mechanics. Consequently, we have proved 
that this formalism properly unifies both Schrodingers wave mechanics, and Heisenberg, 
Born, and Jordans matrix mechanics. 

As an example, consider a qubit with time evolution determined by the Hamiltonian 
(1 \ 

H = T^hcoZ, with Z = q A This may be a spin in a magnetic field, for example, such 

that co = -eB/mc. We want to calculate the time evolution of the operator Xu(t). Since we 
work in the Heisenberg picture alone, we will omit the subscript H. First, we evaluate the 
commutator in the Heisenberg equation 

ldX 1 co 
ih 2^ = 2 lX ' m = -"'2 Y - (3 ' 8) 



(0 -i 
where we defined Y = . 

i 



So now we must know the time evolution of Y as well: 

These are two coupled linear equations, which are relatively easy to solve: 

X = -coY and Y = coX and Z = 0. (3.10) 
We can define two new operators S+ =X ±iY, and obtain 

S+ = -coY±icoX = ±iwS+. (3.11) 
Solving these two equations yields S+(t) = S+(0)e ±ia)t , and this leads to 

S+(t) + S-(t) S + (0)e itot + S-(0)e- i(Ot 



X(t) 



2 2 

: - \x(0)e iwt + iY(0)e ia,t + X(0)e~ iwt - iY(0)e~ iwt 
X(0)cos(cot)-Y(0)sin(cot). (3.12) 



You are asked to show that Y(t) = Y(0)cos(cot) + X(0)sin(cot) in exercise 3. 
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We now take \vh) = |0> and X(0) = ^ Jj, Y(0) = K ^j. The expectation value of 
X(t) is then readily calculated to be 

<0| X(t) |0> = cos(w£) (0| X(0) |0) - sin(wf) <0| Y(0) |0> = . (3. 13) 

Alternatively, when = l+), we find 

(+\X(t)\+) = cos((ot) and (+|Y(£)|+) = sin(wf). (3.14) 
This is a circular motion in time: 



< 

I-) 


a 

► — 


11+) 


< 


► — 





The eigenstate of X(n/2) is point a, and the eigenstate of X(—n/2) is point 6. Furthermore, 
X(±n/2) = +Y(0), and the states at point a and 6 are therefore the eigenstates of Y: 

10) -ill) , , . |0> + i|l> 



and |t^ 6 > 



(3.15) 



y/2 V2 
A natural question to ask is where the states |0) and |1) fit in this picture. These are 
the eigenstates of the operator Z, which we used to generate the unitary time evolution. 
Clearly the states on the circle never become either |0) or |1), so we need to add another 
dimension: 




This is called the Bloch sphere, and operators are represented by straight lines through 
the origin. The axis of rotation for the straight lines that rotate with time is determined by 
the Hamiltonian. In the above case the Hamiltonian was proportional to Z, which means 
that the straight lines rotate around the axis through the eigensates of Z, which are |0) 
and |1). 
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Exercises 

1. Show that for the Hamiltonian Hs = Hh- 

2. The harmonic oscillator has the energy eigenvalue equation H\n) = hoj(n + ^) \n). 

(a) The classical solution of the harmonic oscillator is given by 

| a> = e -2l«l J^^\n), (3.16) 

in the limit of |a| » 1. Show that \a) is a properly normalized state for any 
aeC. 

(b) Calculate the time-evolved state \a(t)). 

(c) We introduce the ladder operators d\n) = \/n\n - 1) and a) \n) = Vn + l\n+ 1). 
Show that the number operator defined by n \n) = n \n) can be written as n = 

(d) Write the coherent state \ a) as a superposition of ladder operators acting on the 
ground state |0). 

(e) Note that the ground state is time-independent ({/(£) |0) = |0>). Calculate the 
time evolution of the ladder operators. 

(f) Calculate the position q = (d + a?)/2 and momentum p = -i(d- d?)l2 of the har- 
monic oscillator in the Heisenberg picture. Can you identify the classical har- 
monic motion? 

3. Let A be an operator given by A = aoD + a x X + a y Y + a z Z. Calculate the matrix A(t) 
given the Hamiltonian H = \hoiZ, and show that A is Hermitian when the are 
real. 

4. The interaction picture. 

(a) Let the Hamiltonian of a system be given by H = H + V, with Ho = p 2 /2m. 
Using \ f(t))j = Ul(t)\y/(t)) s with Uo(t) = exp(-iH t/h), calculate the time de- 
pendence of an operator in the interaction picture Ai(t). 

(b) Defining Hjit) = Ul(t)VU (t), show that 

iH Tt M*)>j (3-17) 

Is Hi identical to Hh and 

5. The time operator in quantum mechanics. 

(a) Let H \\f/) = E \ifr) , and assume the existence of a time operator conjugate to H, 
i.e., [H,T] = ih. Show that 

He iaT \y/) = (E-h(D)e iaT \y/) . (3.18) 
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(b) Given that (DeU, calculate the spectrum of if. 

(c) The energy of a system must be bounded from below in order to avoid infinite 
decay to ever lower energy states. What does this mean for T? 

6. Consider a three-level atom with two (degenerate) low-lying states |0) and |1> with 
zero energy, and a high level \e) (the "excited" state) with energy hco. The low levels 
are coupled to the excited level by optical fields D.ocosw t and Qicoswi^, respec- 
tively. 

(a) Give the (time-dependent) Hamiltonian H for the system. 

(b) The time dependence in H is difficult to deal with, so we must transform to the 
rotating frame via some unitary transformation U(t). Show that 

H' = U(t)HUHt) -mu . 

at 

You can use the Schrodinger equation with = U* 

(c) Calculate H' if U(t) is given by 



U(t) 



(1 

g-i(too-wi)i o 
e -i<»ot 



Why can we ignore the remaining time dependence in H'l This is called the 
Rotating Wave Approximation. 

(d) Calculate the A = eigenstate of if' in the case where co = (0\. 

(e) Design a way to bring the atom from the state |0) to |1) without ever populating 
the state \e). 
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4 Mixed States and the Density Operator 

So far we have considered states as vectors in Hilbert space that, according to the first pos- 
tulate, contain all the information about the system. In reality, however, we very rarely 
have complete information about a system. For example, the system may have interacted 
with its environment, which introduces some uncertainty in our knowledge of the state 
of the system. The question is therefore how we describe systems given incomplete in- 
formation. Much of contemporary research in quantum mechanics is about gaining full 
control over the quantum system (meaning to minimize the interaction with the environ- 
ment). This includes the field of quantum information and computation. The concept of 
incomplete information is therefore central to modern quantum mechanics. 



4.1 Mixed states 

First, we recall some properties of the trace: 

• Tr(aA) = aTr(A), 

• Tr(A+5) = Tr(A) + Tr(B). 

Also remember that we can write the expectation value of A as 

<A>=Tr(|^)(^|A), 



(4.1) 



where \y/) is the state of the system. It tells us everything there is to know about the 
system. But what if we don't know everything? 

As an example, consider that Alice prepares a qubit in the state |0) or in the state 
1+) = (|0> + \ l))/y/2 depending on the outcome of a balanced (50:50) coin toss. How does 
Bob describe the state before any measurement? First, we cannot say that the state is 
\ |0) + \ |+), because this is not normalized! 

The key to the solution is to observe that the expectation values must behave correctly. 
The expectation value (A) is the average of the eigenvalues of A for a given state. If the 
state is itself a statistical mixture (as in the example above), then the expectation values 
must also be averaged. So for the example, we require that for any A 



(A) = ^<A>o + ^<A> + = ^Tr(|0> <0| A)+ ^Tr(|+> <+| A) = Tr 
= Tr(pA), 



J|0> <0| + 
,z Z 



where we defined 



p = ^io> <oi + £i+x+i 



(4.2) 



(4.3) 



The statistical mixture is therefore properly described by an operator, rather than a simple 
vector. We can generalize this as 



P = Y,PkWk)(Vk\ , 
k 



(4.4) 
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where the pk are probabilities that sum up to one (Y.kPk = 1) and the \y/k) are normalized 
states (not necessarily complete or orthogonal). Since p acts as a weight, or a density, in 
the expectation value, we call it the density operator. We can diagonalize p to find the 
spectral decomposition 



p = L x j\W( x j\> 



(4.5) 



where forms a complete orthonormal basis, < Xj < 1, and T.jXj = 1. We can also 

show that p is a positive operator: 

(y/\ p \y/) = J^CjCk (A/| p \X k ) = JL c j c k E x l \M) <M\W 
jk jk l 

= Y J c*C} t Xi{Xj\Xi){Xi\X} t ) = Y J c*CkXi5ji6ik 

jkl jkl 

I 



>0. 



(4.6) 



In general, an operator p is a valid density operator if and only if it has the following three 
properties: 

1. p+ = p, 

2. Tr(p) = l, 

3. p>0. 

The density operator is a generalization of the state of a quantum system when we have 
incomplete information. In the special case where one of the pj = 1 and the others are 
zero, the density operator becomes the projector \y/j) (y/j\. In other words, it is completely 
determined by the state vector We call these pure states. The statistical mixture of 
pure states giving rise to the density operator is called a mixed state. 

The unitary evolution of the density operator can be derived directly from the Schro- 
dinger equation ihd t \y/) = H\y/): 



in L\ 

j 1 



dt 



Wj){yj\+pj 



[dt 



vo->|<voMvo->[^<vo- 



d 



= ih^+Hp 
dt 



pH 



= [H, P ] + ih^. 



(4.7) 



This agrees with the Heisenberg equation for operators, and it is sometimes known as 
the Von Neumann equation. In most problems the probabilities pj have no explicit time- 
dependence, and d t p = 0. 
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4.2 Decoherence 

The density operator allows us to consider the phenomenon of decoherence. Consider the 
pure state |+>. In matrix notation with respect to the basis {|+>,|->}, this can be written 
as 



1 




(4.8) 



The trace is 1, and one of the eigenvalues is 1, as required for a pure state. We can also 
write the density operator in the basis {|0> , |1>}: 



l+><+l = -p 
V2 



(4.9) 



Notice how the outer product (as opposed to the inner product) of two vectors creates a 
matrix representation of the corresponding projection operator. 
Let the time evolution of |+) be given by 



10) + |1) \0) + e mt \l) 



1+) 



V2 



V2 



The corresponding density operator becomes 



(4.10) 



1 i 

*o= 5 



iwt 



1 e 

-iuit 



(4.11) 



The "population" in the state |+) is given by the expectation value 

<+|p(f)|+> = \ + Jcos(w£). 



(4.12) 



This oscillation is due to the off-diagonal elements of p(t), and it is called the coherence 
of the system (see Fig. 2). The state is pure at any time t. In real physical systems the 
coherence often decays exponentially at a rate y, and the density matrix can be written as 



1 I 

*o= 5 



1 e ioit-jt 
g-icot-jt ! 



(4.13) 



The population in the state |+) decays accordingly as 

1 e~ rt cos(wt) 
<+lp(*)l+> = - + ■ 



(4.14) 



This is called decoherence of the system, and the value of y depends on the physical mech- 
anism that leads to the decoherence. 

The decoherence described above is just one particular type, and is called dephasing. 
Another important decoherence mechanism is relaxation to the ground state. If the state 
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time 



Figure 2: Population in the state |+> with decoherence (solid curve) and without (dashed 
curve). 



|1) has a larger energy than |0) there maybe processes such as spontaneous emission that 
drive the system to the ground state. Combining these two decay processes, we can write 
the density operator as 



2-e~ rit e io)t ~ T2t 
e -iwt-j 2 t e -yit 



(4.15) 



The study of decoherence is currently one of the most important research areas in quan- 
tum physics. 



4.3 Imperfect measurements 

Postulates 3 and 5 determine what happens when a quantum system is subjected to a 
measurement. In particular, these postulates concern ideal measurements. In practice, 
however, we often have to deal with imperfect measurements that include noise. As a 
simple example, consider a photodetector: not every photon that hits the detector will 
result in a detector "click", which tells us that there indeed was a photon. How can we 
describe situations like these? 

First, let's recap ideal measurements. We can ask the question what will be the out- 
come of a single measurement of the observable A. We know from Postulate 3 that the 
outcome m must be an eigenvalue a m of A. If the spectral decomposition of A is given by 

A = Y j a m \m)(m\, (4.16) 

m 

then the probability of finding measurement outcome m is given by the Born rule 



pirn) = \{mty)\ 2 = Tr(P m \y/) (yr\) = <P m > , 



(4.17) 
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where we introduced the operator P m = \m) (m\. One key interpretation of p(m) is as the 
expectation value of the operator P m associated with measurement outcome m. 

When a measurement does not destroy the system, the state of the system must be 
updated after the measurement to reflect the fact that another measurement of A imme- 
diately following the first will yield outcome m with certainty. The update rule is given by 
Eq. (2.24) 

W)- , ■ (4.18) 

\f{y\plPm\v) 

where the square root in the denominator is included to ensure proper normalization of 
the state after the measurement. However, this form is not so easily generalized to mea- 
surements yielding incomplete information, so instead we will write (using = P m = P^) 



i w i„t^ - (4 - 19) 



P m \ys)(ys\Pj 
Tr(P m |^)<^|P;) 

From a physical perspective this notation is preferable, since the unobservable global 
phase of (m\y/) no longer enters the description. Up to this point, both state preparation 
and measurement were assumed to be ideal. How must this formalism of ideal measure- 
ments be modified in order to take into account measurements that yield only partial in- 
formation about the system? First, we must find the probability for measurement outcome 
m, and second, we have to formulate the update rule for the state after the measurement. 

When we talk about imperfect measurements, we must have some way of knowing (or 
suspecting) how the measurement apparatus fails. For example, we may suspect that a 
photon hitting a photodetector has only a finite probability of triggering a detector "click". 
We therefore describe our measurement device with a general probability distribution 
{qj(m)}, where q j(m) is the probability that the measurement outcome m in the detector is 
triggered by a system in the state \ The probabilities q j(m) can be found by modelling 
the physical aspects of the measurement apparatus. The accuracy of this model can then 
be determined by experiment. The probability of measurement outcome m for the ideal 
case is given in Eq. (4.17) by (P m ). When the detector is not ideal, the probability of finding 
outcome m is given by the weighted average over all possible expectation values (Pj) that 
can lead to m: 

pirn) = Y^qj(m){Pj) = £>;(m)Tr(P 7 - \y) (y/\) 

j j 



Tr 



= Tr(E m \y)(y\) 



Y.Qj<m)Pj\^){i(/\ 

j 

= (E m ), (4.20) 
where we defined the operator E m associated with outcome m as 

E m = Y j qj(m)Pj. (4.21) 
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Each possible measurement outcome m has an associated operator E m , the expectation 
value of which is the probability of obtaining m in the measurement. The set of E m is 
called a Positive Operator-Valued Measure (POVM). While the total number of ideal mea- 
surement outcomes, modelled by P m , must be identical to the dimension of the Hilbert 
space (ignoring the technical complication of degeneracy), this is no longer the case for the 
POVM described by E m ; there can be more or fewer measurement outcomes, depending 
on the physical details of the measurement apparatus. For example, the measurement of 
an electron spin in a Stern-Gerlach apparatus can have outcomes "up", "down", or "failed 
measurement". The first two are determined by the position of the fluorescence spot on 
the screen, and the last may be the situation where the electron fails to produce a spot on 
the screen. Here, the number of possible measurement outcomes is greater than the num- 
ber of eigenstates of the spin operator. Similarly, photodetectors in Geiger mode have only 
two possible outcomes, namely a "click" or "no click" depending on whether the detector 
registered photons or not. However, the number of eigenstates of the intensity operator 
(the photon number states) is infinite. 

Similar to the density operator, the POVM elements E m have three key properties. 
First, the p(m) are probabilities and therefore real, so for all states \y/) we have 

(E m )* = (E m ) o (y/\E m \y/) = (y/\El\y/), (4.22) 

and E m is therefore Hermitian. Second, since I m p(m) = 1 we have 

£>(m) = £ (y,\E m \yr) = (yr\ J^E m \y,) = l, (4.23) 

mm m 

for all \y/), and therefore 

Y J E m = l (4.24) 

m 

where I is the identity operator. Finally, since p(m) = (y/\E m \iff) > for all \y/), the POVM 
element E m is a positive operator. Note the close analogy of the properties of the POVM 
and the density operator. In particular, just as in the case of density operators, POVMs are 
defined by these three properties. 

The second question about generalized measurements is how the measurement out- 
comes should be used to update the quantum state of the system. The rule for ideal von 
Neumann measurements is given in Eq. (4.19), which can be generalized immediately to 
density operators using the techniques presented above. This yields 

P mPPl (4 _ 25) 



Tr(P mP Pl) 



What if we have instead a measurement apparatus described by a POVM? Consistency 
with the Born rule in Eq. (4.20) requires that we again replace P m , associated with mea- 
surement outcome m, with a probability distribution over all Pj\ 

P/pPt P jP P\ 
P - E^«— f " 7T = E^SwiT^ ■ (4 " 26) 



j Tr 



LkQk(m)PkpPl 



Tr(£ m p)' 
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where we adjusted the normalization factor to maintain Tr(p) = 1. We also used that the 
POVM element E m in Eq. (4.21) can be written as 

E m = Y j q j (m)P]P j . (4.27) 



If we rescale the Pj by a factor y/gj(m), we obtain the standard form of the POVM 



where Aj m = ^/qJimjPj are the so-called the Kraus operators. Consequently, the update 
rule can be written as 

LjA jm pA] m 

9 Tr(E mP ) ' (4 " 29) 

which generally yields a mixed state (described by a density operator) after an incomplete 
measurement. 

Finally, we can consider the case of some nonunitary evolution without a measurement 
(e.g., when the system interacts with its environment). We can model this purely formally 
by removing the index of the measurement outcome m from the description (since there 
are no measurement outcomes). The most general evolution then takes the form 

p^p' = ^A kP Al, (4.30) 

k 

and the question is what form the Kraus operators take. We will return to this in 
section 6. 



Exercises 

1. The density matrix. 

(a) Show that 1 10> + % 1+) is n °t a properly normalized state. 

(b) Show that Tr(p) = 1 with p given by Eq. (4.3), and then prove that any density 
operator has unit trace and is Hermitian. 

(c) Show that density operators are convex, i.e., that p = w\p\+W2P2 with W1 + W2 = 
1 (i6>i,i6>2 ^ 0), and pi, p2 again density operators. 

(d) Calculate the expectation value of A using the two representations of p in terms 
of pi and the spectral decomposition. What is the difference in the physical 
interpretation of pj and Ay? 

2. Using the identity (x\A\y/) = Aif/(x), and the resolution of the identity / dx\x) (x\ = 
D, calculate the expectation value for an operator A, given a mixed state of wave 
functions. 



Section 4: Mixed States and the Density Operator 



37 



3. Calculate P 2 with P given by 



i 

P = 



a 



[a*,b*] 



'\a\ 2 ab* 
a*b 



and \af + \bf = l. (4.31) 



4. Calculate the eigenvalues of the density matrix in Eq. (4.15), and show that yi < 2j2- 
Hint: it is difficult to derive the inequality directly, so you should try to demonstrate 
that yi > 2j2 leads to a contradiction. 

5. A system with energy eigenstates \E n ) is in thermal equilibrium with a heat bath at 
temperature T. The probability for the system to be in state E n is proportional to 

e -E n /kT_ 

(a) Write the Hamiltonian of the system in terms of E n and \E n ). 

(b) Give the normalized density operator p for the system as a function of the 
Hamiltonian. 

(c) We identify the normalization of p with the partition function «2f . Calculate the 
average energy directly and via 

31n«Z 

{E} --mry (4 ' 32) 

of the system. 

(d) What is the entropy S = k In 2" if the system is a harmonic oscillator? Comment 
on the limit T — 0. 

6. Lossy photodetectors. 

(a) The state of a beam of light can be written in the photon number basis \n) 
as = T.n c n\n). What are the possible measurement outcomes for a perfect 
photon detector? Calculate the probabilities of the measurement outcome using 
projection operators. 

(b) Suppose that the detector can only tell the difference between the presence and 
absence of photons (a so-called "bucket detector"). How do we calculate the 
probability of finding the measurement outcomes? 

(c) Real bucket detectors have a finite efficiency rj, which means that each photon 
has a probability r\ of triggering the detector. Calculate the probabilities of the 
measurement outcomes. 

(d) What other possible imperfections do realistic bucket detectors have? 
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An electron with spin state \tf/) = a|t>+j8||> and |a| 2 + |/3| 2 = 1 is sent through a Stern- 
Gerlach apparatus to measure the spin in the z direction (i.e., the ||>, ||> basis). 

(a) What are the possible measurement outcomes? If the position of the electron 
is measured by an induction loop rather than a screen, what is the state of the 
electron immediately after the measurement? 

(b) Suppose that with probability p the induction loops fail to give a current sig- 
nifying the presence of an electron. What are the three possible measurement 
outcomes? Give the POVM. 

(c) Calculate the probabilities of the measurement outcomes, and the state of the 
electron imediately after the measurement (for all possible outcomes). 
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5 Composite Systems and Entanglement 

5.1 Composite systems 

Suppose we have two systems, described by Hilbert spaces J€\ and J(?<i, respectively We 
can choose orthonormal bases for each system: 

jei-{\<pi),\<p2),-..,\<pN)} and J?2:{\n),\V2),...,\yM)}. (5.1) 

The respective dimensions of J€\ and J€<i are N and M. We can construct N x M basis 
states for the composite system via \<pj) and \ipk)- This implies that the total Hilbert 
space of the composite system can be spanned by the tensor product 

{\(t>j)®\Vk)} jk on ^ 1+2 = ^i®^?2- (5.2) 
An arbitrary pure state on J?i+2 can be written as 

= L c Jk \<Pj) ® \n) = L c Jk \<i>j,n) ■ (5-3) 

jk jk 

For example, the system of two qubits can be written on the basis {|0,0> , |0, 1) , |1,0) , |1, 1)}. 
If system 1 is in state \<p) and system 2 is in state \i[r), the partial trace over system 2 
yields 

Tr 2 (|0,^><0,V|) = Tr 2 (|0><0| <S» |^> <^|) = 1 0> <^| Tr( 1 <^| ) = |0> <0| , (5.4) 

since the trace over any density operator is 1. We have now lost system 2 from our de- 
scription! Therefore, taking the partial trace without inserting any other operators is the 
mathematical version of forgetting about it. This is a very useful feature: you often do not 
want to deal with every possible system you are interested in. For example, if system 1 is 
a qubit, and system two is a very large environment the partial trace allows you to "trace 
out the environment". 

However, tracing out the environment will not always leave you with a pure state as 
in Eq. (5.4). If the system has interacted with the environment, taking the partial trace 
generally leaves you with a mixed state. This is due to entanglement between the system 
and its environment. 

5.2 Entanglement 

Consider the following experiment: Alice and Bob each blindly draw a marble from a vase 
that contains one black and one white marble. Let's call the state of the write marble |0) 
and the state of the black marble |1). If we describe this classical experiment quantum 
mechanically (we can always do this, because classical physics is contained in quantum 
physics), then there are two possible states, |1,0) and |0, 1). Since blind drawing is a 
statistical procedure, the state of the marbles held by Alice and Bob is the mixed state 



P = ^ |0,i) <o,i| + i |i,o>u,o|. 



(5.5) 
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From Alice's perspective, the state of her marble is obtained by tracing over Bob's marble: 

p A = Tr s (p) = - 10> <0| + 1 11> <1| . (5.6) 

This is what we expect: Alice has a 50:50 probability of finding "white" or "black" when 
she looks at her marble (i.e., when she measures the colour of the marble). 

Next, consider what the state of Bob's marble is when Alice finds a white marble. Just 
from the setup we know that Bob's marble must be black, because there was only one 
white and one black marble in the vase. Let's see if we can reproduce this in our quantum 
mechanical description. Finding a white marble can be described mathematically by a 
projection operator |0) <0| (see Eq. (2.24)). We need to include this operator in the trace 
over Alice's marble's Hilbert space: 

TrA(|0)A<0lp) mn| f _„. 
PB= Tr(|0>A(0|p) - |1>(11 ' (5 " 7) 

which we set out to prove: if Alice finds that when she sees that her marble is white, she 
describes the state of Bob's marble as black. Based on the setup of this experiment, Alice 
knows instantaneously what the state of Bob's marble is as soon as she looks at her own 
marble. There is nothing spooky about this; it just shows that the marbles held by Alice 
and Bob are correlated. 

Next, consider a second experiment: By some procedure, the details of which are not 
important right now, Alice and Bob each hold a two-level system (a qubit) in the pure state 

V2 

Since |1,0) and |0, 1) are valid quantum states, by virtue of the first postulate of quantum 
mechanics \*¥)ab is also a valid quantum mechanical state. It is not difficult to see that 
these systems are also correlated in the states |0) and |1): When Alice finds the value "0", 
Bob must find the value "1", and vice versa. We can write this state as a density operator 

p = -(|0,1) + |1,0»( (0,1| + <1,0|) 

= \ (|0, 1) (0, 1| + |0, 1) <1,0| + |1,0) (0, 1| + |1,0) (1,01) . (5.9) 

Notice the two extra terms with respect to Eq. (5.5). If Alice now traces out Bob's system, 
she finds that the state of her marble is 

p A = Tr B (p) = - |0) (0| + i |1) (II • (5.10) 

In other words, even though the total system was in a pure state, the subsystem held by 
Alice (and Bob, check this) is mixed! We can try to put the two states back together: 



PA ® PB = 



^10) (0| + \ |1) (1| 



\ |0) (0| + \ ID (II 



= ^(|0,0) (0,01 + |0, 1) (0, 1| + 11,0) (1,01 + |1,1) (1, II), (5.11) 
4 
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but this is not the state we started out with! It is also a mixed state, instead of the pure 
state we started with. Since mixed states mean incomplete knowledge, there must be 
some information in the combined system that does not reside in the subsystems alone\ 
This is called entanglement. 

Entanglement arises because states like (|0, 1) + 1 1, 0))/\/2 cannot be written as a tensor 
product of two pure states ® \<p). These latter states are called separable. In general a 
state is separable if and only if it can be written as 

P = LPjPf^pT- (5.12) 

j 

Classical correlations such as the black and white marbles above fall into the category of 
separable states. 

So far, we have considered the quantum states in the basis {|0> , |1>}. However, we can 
also describe the same system in the rotated basis {|+>} according to 

|0> = — and |1> = — . (5.13) 

y/2 y/2 

The entangled state \ x ¥)ab can then be written as 

10,1) + H,0) !+,+)-!-,-) 



y/2 y/2 



(5.14) 



which means that we have again perfect correlations between the two systems with re- 
spect to the states |+) and |— ). Let's do the same for the state p in Eq. (5.5) for classically 
correlated marbles. After a bit of algebra, we find that 

P = ^(l++X++I + I+-X+-I + I-+X-+I + I— X— I 
-I++X— I -I— X++I-I+-X-+I-I-+X+-I). (5.15) 

Now there are no correlations in the conjugate basis {|+>}, which you can check by cal- 
culating the conditional probabilities of Bob's state given Alice's measurement outcomes. 
This is another key difference between classically correlated states and entangled states. 
A good interpretation of entanglement is that entangled systems exhibit correlations that 
are stronger than classical correlations. We will shortly see how these stronger correla- 
tions can be used in information processing. 

We have seen that operators, just like states, can be combined into tensor products: 

A®B|0)® \y)=A\(f))®B\if/) . (5.16) 
And just like states, some operators cannot be written as A ®B: 

C = Y J A k ®B k . (5.17) 

k 
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This is the most general expression of an operator in the Hilbert space J€\ ® J€i- In Dirac 
notation this becomes 

C = E (pjklm\(l>j)(<Pk\®\<Pl)(<Pm\= E (Pjklm\(l>j,(l>l)((l>k,<pm\- (5.18) 
jklm jklm 

As an example, the Bell operator is diagonal on the Bell basis: 

± I0.0>^| 1 . 1 > I0.1)±|1.0> 

1 \/2 1 \/2 

The eigenvalues of the Bell operator are not important, as long as they are not degenerate 
(why?). A measurement of the Bell operator projects onto an eigenstate of the operator, 
which is an entangled state. Consequently, we cannot implement such composite measure- 
ments by measuring each subsystem individually, because those individual measurements 
would project onto pure states of the subsystems. And we have seen that the subsystems 
of pure entangled states are mixed states. 

A particularly useful technique when dealing with two systems is the so-called Schmidt 
decomposition. In general, we can write any pure state over two systems as a superposi- 
tion of basis states: 

\ x v) = LLcjk\<Pj) A \n) B , (5.20) 

where d A and d B are the dimensions of the Hilbert spaces of system A and B, respectively, 
and we order the systems such that d B >d A - It turns out that we can always simplify this 
description and write \W) as a single sum over basis states. We state it as a theorem: 

Theorem: Let |^) be a pure state of two systems, A and B with Hilbert spaces J€ A 
and J€ B of dimension d A and d B > d A , respectively. There exist orthonormal basis 
vectors \aj) A for system A and \ bj) B for system B such that 

m=Y,lj\aj) A \bj) B , (5.21) 

j 

with real, positive Schmidt coefficients Xj, and L/A^ = 1. This decomposition is 
unique, and the sum runs at most to d A , the dimension of the smallest Hilbert space. 
Traditionally, we order the Schmidt coefficients in descending order: Ai > A2 > .... 
The total number of non-zero A; is the Schmidt number. 

The proof can be found in many graduate texts on quantum mechanics and quantum 
information theory. 

Given the Schmidt decomposition for a bi-partite system, we can immediately write 
down the reduced density matrices for the sub-systems: 

PA = Tr B (m = ^ A* \aj) A (aj\ and p B = Tr A (|^> QV\) = J^X 2 j \bj) B (bj\ . 

j j 

(5.22) 
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The basis states and \bj) B may have completely different physical meanings; here 

we care only that the states of the decomposition can be labelled with a single index, as 
opposed to two indices. 

Conversely, when we have a single system in a mixed state 

P = LPj\<*j)M> ( 5 - 23 ) 

j 

we can always construct a pure state I 1 ?) that obeys (Xj = y/pj) 

l^> = E%|a i ,6y>, (5.24) 

j 

By virtue of the Schmidt decomposition. The state | M?) is called the purification of p. Since 
many theorems are easier to prove for pure states than for mixed states, purifications can 
make our work load significantly lighter. 

When there is more than one non-zero Xj in Eq. (5.24), the state |Y) is clearly entan- 
gled: there is no alternative choice of Xj due to the uniqueness of the Schmidt decompo- 
sition that would result in A' x = 1 and all others zero. Moreover, the more uniform the 
values of Xj, the more the state is entangled. One possible measure for the amount of 
entanglement in I^F) is the Shannon entropy H 

H = -Y J rflog 2 X 2 j . (5.25) 

j 

This is identical to the von Neumann entropy S of the reduced density matrix p of | *¥) 
given in Eq. (5.23): 

S(p)=-Tr(plog 2 p). (5.26) 

Both entropies are measured in classical bits. 

How do we find the Schmidt decomposition? Consider the state I^P) from Eq. (5.20). 
The (not necessarily square) matrix C with elements cjk needs to be transformed into a 
single array of numbers Xj. This is achieved by applying the singular-value decomposition: 

Cjk=Y, U ji d H V ik> (5.27) 
i 

where Ujt and Vik are elements of unitary matrices U and V, respectively, and da is a 
diagonal matrix with singular values A; . The vectors in the Schmidt decomposition become 

\ a i) =Y. u Ji\ ( Pj) and \bi)=Y* v ik\Vh) - (5.28) 

j k 

This is probably a good time to remind ourselves about the singular-value decomposition. 
All we need to do is find U and V, the rest is just matrix multiplication. To find U, we 
diagonalize CC^ and find its eigenvectors. These form the columns of U. Similarly, we 
diagonalize C^C and arrange the eigenvectors in columns to find V. If C is an n x m 
matrix, U should be n x n and V should be m x m. 
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5.3 Quantum teleportation 

Probably the most extraordinary use of the quantum correlations present in entanglement 
is quantum teleportation. Alice and Bob share two entangled qubits, labelled 2 (held by 
Alice) and 3 (held by Bob), in the state (|0,0> 23 + |l,l) 2 3)A/2. In addition, Alice holds a 
qubit in the state 

l^^alOh + jSHh. (5.29) 

The object of quantum teleportation is to transfer the state of qubit 1 to qubit 3, without 
either Alice or Bob gaining any information about a or /3. To make things extra hard, the 
three qubits must not change places (so Alice cannot take qubit 1 and bring it to Bob). 

Classically, this is an impossible task: we cannot extract enough information about a 
and j6 with a single measurement to reproduce \y) faithfully, otherwise we could violate 
the no-cloning theorem. However, in quantum mechanics it can be done (without violating 
no-cloning). Write the total state as 

\%) = \xf/) 1 10 + ) 23 = -J= (a |000> + a |011> + j8 1 100) + j8 1111>) . (5.30) 

Alice now performs a Bell measurement on her two qubits (1 and 2), which project them 
onto one of the Bell states |^ >± >i 2 or | x * /± >i 2 - We write 100), 1 01), 1 10), and 111) in the Bell 
basis: 



100) 
101) 
HO) 



|<D + ) + |<p-) 
| x I /+ ) + l v I / ") 



111) = J . (5.31) 

V2 

We can use these substitutions to write the state \%) before the measurement as 

\x) = \ [a |<J> + > 12 I0) 3 + a |0-)i2 10) 3 + a |^ + ) 12 11) 3 + a |^") 12 11) 3 
+13 \W + ) 12 10) 3 - j8 |¥">i2 |0) 3 + ft |0 + ) 12 11) 3 - |0-)i2 IDs] 
= ^[|O + )(a|O) + i 0|l))+|O-)(a|O)- i 0|l)) 

+ i/5 10) + a ID) + (j8 10) - all))] . (5.32) 
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Alice finds one of four possible outcomes: 



vp- 



Tr 12 (\^ + )(^ + \\ X )(x\) - |^) 3 = a|0> + i 6|l>, 

Tr 12 (\^-)(^-\\x)(x\) - |^>3 = a|0>-j6|l>, 

Tr 12 (\W + )(W + \\x)(x\) - |^> 3 = «ll> + )6|0>, 

Tn 2 (|^-><^-|UXl|) - |^>3 = «|1> - yS|0> . (5.33) 



From these outcomes, it is clear that the state held by Bob is different for the different 
measurement outcomes of Alice's Bell measurement. Let this sink in for a moment: After 
setting up the entangled state between Alice and Bob, who may be literally light years 
apart, Bob has done absolutely nothing to his qubit, yet its state is different depending on 
Alice's measurement outcome. This suggests that there is some instantaneous communi- 
cation taking place, possibly violating causality! 

In order to turn the state of Bob's qubit into the original state, Alice needs to send the 
measurement outcome to Bob. This will take two classical bits, because there are four 
outcomes. The correction operators that Bob need to apply are as follows: 

+ :D, <D":Z, ¥ + :X, W~:ZX. (5.34) 

So in each case Bob needs to do something different to his qubit. To appreciate how re- 
markable this protocol is, here are some of its relevant properties: 

1. No matter is transported, only the state of the system; 

2. neither Alice nor Bob learns anything about a or /3; 

3. any attempt to use quantum teleportation for signaling faster than light is futilel 
Exercises 

1. Derive Eq. (5.15), and show that pb = 0/2 when Alice's qubit is projected onto |+). 

2. The state of a bipartite system is given by 

|¥> = -^=(|00> + 3|01> + 2|10> + 5|11> + |12>) . 
V40 



The first system is described by a two-dimensional Hilbert space, and the second by 
a three-dimensional Hilbert space. 

(a) Show that states with Schmidt number 1 are separable. 

(b) Without doing elaborate calculations, what is the Schmidt number of l^)? 

(c) Find the Schmidt decomposition of 
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3. Quantum teleportation. Write the Bell states as \ifr nm ) = (|0,0© n) + (-l) m |1, 1 ffire))/\/2, 
where © denotes addition modulo 2 and n,m = 0, 1. 

(a) Write \y/ nm ) in terms of \y/oo) and the Pauli operators X and Z acting on the 
second qubit. 

(b) Using the shared Bell state \ f nm ) between Alice and Bob, and the two-bit mea- 
surement outcome ij,k) for Alice's Bell measurement, determine the correction 
operator for Bob. 

(c) We now generalize to A^-dimensional systems. We define the N 2 entangled 
states 

\Vnm) = -^= N £e 2 * ijn/N \j,j®m) , 
VN j=0 

where n®m = n + m mod N. Prove that this is an orthonormal basis. 

(d) Give the teleportation protocol for the A^-dimensional Hilbert space. 

(e) What is Bob's state before he learns Alice's measurement outcome? 

4. Imperfect measurements. 

(a) A two-qubit system (held by Alice and Bob) is in the anti-symmetric Bell state 
1^"). Calculate the state of Bob's qubit if Alice measures her qubit in the state 
|0>. Hint: write the measurement procedure as a partial trace over Alice's qubit. 

(b) Now Alice's measurement is imperfect, and when her apparatus indicates "0", 
there was actually a small probability p that the qubit was projected onto the 
state |1>. What is Bob's state? 

(c) If Alice's (imperfect) apparatus has only two measurement outcomes, what will 
Bob's state be if she finds outcome "1"? 
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6 Evolution of Open Quantum Systems 

We have considered mixed states, where the experimenter has incomplete information 
about the state preparation procedure, and we have also seen that mixing arises in a sys- 
tem when it is entangled with another system. The combined system can still be pure, 
but the subsystem has become mixed. This phenomenon arises often when we want to de- 
scribe systems that have some interaction with their environment. The interaction creates 
entanglement, and the system taken by itself evolves from a pure state to a mixed state. 
Such a system is called "open", since it can leak quantum information to the environment. 
The theory of open quantum systems revolves around the so-called Lindblad equation. 



6.1 The Lindblad equation 

Next, we will derive the Lindblad equation, which is the direct extension of the Heisen- 
berg equation for the density operator, i.e., the mixed state of a system. We have seen in 
Eq. (4.30) that formally, we can write the evolution of a density operator as a mathematical 
map S, such that the density operator p transforms into 

p^p' = g( P ) = ^A k pAl (6.1) 

k 

where the Ak are the Kraus operators. Requiring that p' is again a density operator 
(Tr(p') = 1) leads to the restriction Y.kA\A k = 0. 

We want to describe an infinitesimal evolution of p, in order to give the continuum 
evolution later on. We therefore have that 

p' = p + 6p = J^A k pAl. (6.2) 

k 

Since 8p is very small, one of the Kraus operators must be close to the identity. Without 
loss of generality we choose this to be Aq, and then we can write 

A = D + (L - iK)8t and A k =L k Vsi, (6.3) 

where we introduced the Hermitian operators Lq and K, and the remaining Lk are not 
necessarily Hermitian. We could have written Aq = D + Lo<5£ and keep Lq general (non- 
Hermitian as well), but it will be useful later on to explicitly decompose it into Hermitian 
parts. We can now write 

A pAl = p+ [(Lo - iK)p + p(L + iK)\ 8t + 0(6t 2 ) 
A k pAl=L k pL\8t. (6.4) 

We can substitute this into Eq. (6.2), to obtain up to first order in 8t 



8p 



{L p + pL ) - i(Kp -pK)+Y, L kpL 



t 
k 



St. (6.5) 
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We now give the continuum evolution by dividing by 6t: 

^ = -i[K, p] + {L , p}+£ L kP L\ , (6.6) 

where {A,B} = AB +BA is the anti-commutator of A and B. We are almost there, but we 
must determine what the different terms mean. Suppose we consider the free evolution of 
the system. Eq. (6.6) must then reduce to the Heisenberg equation for the density operator 
p in Eq. (4.7), and we see that all including Lq are zero, and K is proportional to the 
Hamiltonian K = H/h. Again from the general property that Tr(p) = 1 we have 



Tr 



\dt) 2 W 



0-Z, = --2>U,fc. (6.7) 



This finally leads to the Lindblad equation 

\lH,p] + W [ZL kP L\ - {L\L k ,p}) . (6.8) 



dp 

dt ih' 



The operators are chosen such that they model the relevant physical processes. This 
may sound vague, but in practice it will be quite clear. For example, modelling a transition 
|1) — ► |0) without keeping track of where the energy is going or coming from will require a 
single Lindblad operator 

L = y|0><l|, (6.9) 

where 7 is a real parameter indicating the strength of the transition. This can model both 
decay and excitations. 

6.2 Positive and completely positive maps 

We considered the evolution of the density operator under a family of Kraus operators in 
Eq. (4.30): 

p^p' = S(p) = Y J A kP Al, (6.10) 

k 

where Y.k A^Ak = (that is, 8 is trace-preserving). When S transforms any positive opera- 
tor into another positive operator, we call it a positive map. We may be tempted to conclude 
that all positive maps correspond to physically allowed transformations. After all, it maps 
density operators to density operators. Unfortunately, Nature (or Mathematics?) is not 
that tidy. 

Consider the transpose of the density operator p — ► p T , which acts according to 
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You can verify immediately that the trace is preserved in this operation (check this!), 
and p T is again a positive operator since the eigenvalues are identical to those of p. For 
example, consider the qubit state (|0> + i |l))/\/2. The density operator and its transpose 
are 

1(1 -i) , T 1 
p= 2\i ij and p =2 

The transpose therefore corresponds to the state (|0) -i\l))/y2. Now consider that the 
qubit is part of an entangled state (|00) + |ll))/\/2. The density operator is given by 

p = \ (|00> <00| + |00> (111 + 111) <00| + |11> (111) , (6.13) 

and the partial transpose on the first qubit is 

P T = ^(100) <00| + |10) <01| + |01) (10| + |11) (111) . (6.14) 

The eigenvalues of p are all positive, but p T has a negative eigenvalue! So p T cannot be 
a density operator. Consequently, it is not correct to say that positive maps correspond to 
physical processes. We need to put another restriction on maps. 

From the example of the partial transpose, we can deduce that maps must not only 
be positive for the system S that they act on, but also positive on larger systems that 
include S as a subsystem. When this is the case, we call the map completely positive. 
There is a very important theorem in mathematics, called Kraus' Representation Theorem, 
which states that maps of the form in Eq. (6.10) with the restriction that LjkAtA& = II is a 
completely positive map, and moreover, that any completely positive map can be expressed 
in this form. 

6.3 The Choi-Jamiofkowski isomorphism 

There is a very interesting and useful relation between vectors and linear operators in 
Hilbert space that goes by the name of the Choi- Jamiolkow ski isomorphism. Compare the 
operator 

\j){k\EM (6.15) 

with the vector 

\j)®\k)eje®je. (6.16) 

By turning the bra into a second ket we switch between Eq. (6.15) and Eq. (6.16). The 
Choi-Jamiolkowski isomorphism says that the mathematical structure of linear operators 
in J€ is exactly the same as that of the vectors in J€%J€. This can be seen intuitively by 



(6.12) 
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viewing the operators \j) (k\ as basis vectors in J€ with dimension d. For the case where 
d = 3, the basis vectors can be written as nine matrices 





0; 



(0 1 0^ 


,0 0; 



(0 1\ 


,0 0; 



(0 







1 








to 





oj 



(0 0^ 

lo lj 



You can see that we can construct any operator by multiplying these matrices with com- 
plex numbers and adding them. However, there is nothing special about the square ar- 
rangement, and we can place the zeros and ones in a vertical, vectorial arrangement. This 
is now a vector in a complex Hilbert space of dimension d 2 . This isomorphism allows you 
to solve problems in operator space that are hard in the vector space, and vice versa. 
For a general operator U the isomorphism works as follows. Let 



u = Y J u jk \j) {k\ 



(6.17) 



for some basis \j) in J€. Applying the isomorphism, i.e., turning the bra into a ket will 
define a new vector 



(6.18) 



This will generally not be a normalised vector, but that does not concern us here. We can 
also go the other way around, and define an operator given a bi-partite vector: 



m = E \J) \k) - = E cjk \j) (k i 

jk jk 



(6.19) 



In a way, this is all there is to the Choi-Jamiolkowski isomorphism. 

But what does it mean? We can compose two operations into a new operation, but 
we can't do this for vectors, so it is not clear where the isomorphism takes us, physically 
speaking. To figure this out we will write the isomorphism in a slightly different way. 
Consider again the operator U: 



u=Y,u jk \j)(k\ -* Z u jk\J)\k)=L\ML u jk\k)\. 
jk CJ jk j \k } 

However, Y.k Ujk \k) is just U \j), and we can write this as 

E L/> (e^ i*>j = (»a ® u B )Z \j) A \J)b = ("a ® u B ) |o + ) ab , 



(6.20) 



(6.21) 



where we defined |$ + ) = and the subscript on U determines the subspace it acts 

on. So the isomorphism in one direction can be summarised by 



U - |0 [7 >=(l A ®C7 B )|cD + ) 

CJ 



(6.22) 



Section 6: Evolution of Open Quantum Systems 



51 



where the state \Q>u) is isomorphic to U. 

The opposite direction of \<f>u) — ► U needs a little bit more work. We start with an 
operator acting on \xff) (\ff\ and show that it equals 

U A \y/) A (yr\ u\ = BC <$ + | [\®u)ab (®u\ ® W) c M) \® + )bc ■ ( 6 - 23 > 

Since we need to take \0>u) to U, we start with the right-hand side (RHS) and rearrange 
so we end up with the left-hand side: 

RHS = bc <<J> + 1 [\®u)ab <®u\ ® k> c \® + )bc 

= bc (® + 1 [<U A ® lb) <<P + 1 (t/ A 8 b) s |^) c <^|] \® + ) bc 

= E BcO',j'K^A®DB)|^,^,1/'> ABC <^^^|(C / i® ll B)N^>BC 

= E^a I^>a i(k\y))c {l\u\{(y\l))c 
ki 

= U A \V){V\U\. (6.24) 

In the second line we used the isomorphism of Eq. (6.22) and in the third line we expanded 
all |0 + ). In the fourth line we evaluated the inner products of vectors in the Hilbert space 
J€b and used <J\k) = 8jk- In the fifth line we recognise that the inner products ((k\i(/))c are 
just the amplitudes of \y/) for basis vector \k), and the final result follows. 

We can see from Eq. (6.23) that for the specific choice of U = I this is the teleportation 
protocol (up to a local correction). The state \y/) on system C is transferred to system 
A, assuming that the measurement outcome of the Bell measurement was |$ + ). Using a 
nontrivial U then allows us to create the resource state \<f>u) f° r teleportation (instead of 
|<t> + )), and the result is that we teleport the state \i[r) with an added operation U. This is 
called gate teleportation. The name comes from quantum information theory, where U is 
typically a quantum gate on a qubit or a set of qubits. 

This trick is particularly useful when \y/) carries fragile quantum information 5 , and 
when applying U is difficult and noisy. Instead of applying U straight to \y/), and possibly 
destroy it, we can create the state |Ojy). Since this is a known state (as opposed to \y/)) 
we can try to create it as many times as we want, until we get it right. Only then do we 
employ the gate teleportation protocol. Eq. (6.23) then ensures that we obtain the state 
U \y/). This is the fundamental principle behind measurement-based quantum computing, 
which has typically higher fault-tolerance thresholds than the circuit model of quantum 
computing. 



5 Which, when it is gone it cannot be retrieved thanks to the no-cloning theorem. 
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Exercises 

1. Show that Y.k,A\Ak = D, prove that any non-Hermitian square matrix can be written 
as A + iB, with A and B Hermitian, and prove that Lq = -^^.k^oL^Lk- 

2. Consider a two-level system (|0>, |1>) that has a dephasing process, modelled by the 
Lindblad operators L\ = y|0) <1| andZ/2 = y|l) (0|. 

(a) write down the Lindblad equation (choose H = for simplicity). 

(b) Calculate the evolution of the pure states |0) and |+> at t = 0. Hint: write the 
density matrix in the Pauli matrix basis {i,X,Y ,Z}. What can you say about 
the equilibrium state of the system? 

(c) Calculate and plot the entropy S(p) of the state p(t) as a function of 7 and t. 

3. Calculate the eigenvalues of p T in Eq. (6.14). 
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7 Orbital Angular Momentum and Spin 

Angular momentum plays an important role in quantum mechanics, not only as the orbital 
angular momentum of electrons orbiting the central potentials of nuclei, but also as the 
intrinsic magnetic moment of particles, known as spin, and even as isospin in high-energy 
particle physics. 

7.1 Orbital angular momentum 

From classical physics we know that the orbital angular momentum of a particle is given 
by the cross product of its position and momentum 

L = rxp or L i =e ij } l rjP} l , (7.1) 

where we used Einstein's summation convention for the indices. In quantum mechanics, 
we can find the operator for orbital angular momentum by promoting the position and 
momentum observables to operators. The resulting orbital angular momentum operator 
turns out to be rather complicated, due to a combination of the cross product and the fact 
that position and momentum do not commute. As a result, the components of orbital mo- 
mentum do not commute with each other. When we use [rj,pk] = ihdjk, the commutation 
relation for the components of L becomes 

[Li,Lj] = ih€ij k L k . (7.2) 

A set of relations like this is called an algebra, and the algebra here is called closed since 
we can take the commutator of any two elements L; and Lj, and express it in terms of 
another element Another (simpler) closed algebra is [x,p x ] = M and [x,D] = [p x ,0] = 0. 

Since the components of angular momentum do not commute, we cannot find simulta- 
neous eigenstates for L x , L y , and L z . We will choose one of them, traditionally denoted by 
L z , and construct its eigenstates. It turns out that there is another operator, a function of 
all LjS, that commutes with any component Lj, namely L 2 = L 2 + L 2 + L 2 . This operator is 
unique, in that there is no other operator that differs from L 2 in a nontrivial way and still 
commutes with all L;s. We can now construct simultaneous eigenvectors for L z and L 2 . 

Since we are looking for simultaneous eigenvectors for the square of the angular mo- 
mentum and the 2 -component, we expect that the eigenvectors will be determined by two 
quantum numbers, I, and m. First, and without any prior knowledge, we can formally 
write down the eigenvalue equation for L z as 

L z \l,m) = mh\l,m) , (7.3) 

where m is some real number, and we included H to fit the dimensions of angular mo- 
mentum. We will now proceed with the derivation of the eigenvalue equation for L 2 , and 
determine the possible values for I and m. 

From the definition of L 2 , we have L 2 -L 2 = L 2 +L 2 , and 



{l,m\\} -L\\l,m) = {l,m\L 2 x +L 2 y \l,m)>0. 



(7.4) 
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The spectrum of L z is therefore bounded by 

l<m<l (7.5) 

for some value of I. We derive the eigenvalues of L 2 given these restrictions. First, we 
define the ladder operators 

L+ =L x ±iL y with L-=L\. (7.6) 

The commutation relations with L z and L 2 are 

[L z ,L ± ] = +hL ± , [L + ,L-] = 2hL z , [L ± ,L 2 ] = 0. (7.7) 

Next, we calculate L Z (L + \l,m)): 

L Z (L+ \l,m)) = (L+L z + [L z ,L+])\l,m) = mhL+ \l,m) + HL+ \l,m) = (m + 1)HL+ \l,m) . (7.8) 

Therefore L+\l,m) oc \l,m + l). By similar reasoning we find that L-\l,m) oc \l,m — 1). 
Since we already determined that -I < m < I, we must also require that 

L + |Z,Z> = and L_|Z,-Z>=0. (7.9) 

Counting the states between -/ and +1 in steps of one, we find that there are 21 + 1 
different eigenstates for L z . Since 21 + 1 is a positive integer, I must be a half-integer 
(Z = 0,|,l,|,2,...). Later we will restrict this further to I = 0, 1,2, . . . 

The next step towards finding the eigenvalues of L 2 is to calculate the following iden- 
tity: 

L_L+ = (L x - iL y )(L x + iL y ) = L 2 +L 2 + i[L x ,L y ] = L 2 -L 2 - hL z . (7.10) 

We can then evaluate 

L-L+\l,l)=0 => (L 2 -L 2 z -hL z )\l,l) = -L 2 \l,l)-(l 2 + l)h 2 \l,l)=0. (7.11) 

It is left as an exercise (see exercise lb) to show that 

Iu 2 \l,m) = l(l + l)h 2 \l,m) . (7.12) 

We now have derived the eigenvalues for L z and L 2 . 

One aspect of our algebraic treatment of angular momentum we still have to determine 
is the matrix elements of the ladder operators. We again use the relation between L+, and 
L z and L 2 : 

l 

(l,m\L-L + \l,m)= £ (l,m\L- \l,j) (l,j\L + \l,m) . (7.13) 

j=-l 



Section 7: Orbital Angular Momentum and Spin 



55 



Both sides can be rewritten as 

(l,m\\? -L\-hL z \l,m) = (l,m\L-\l,m + 1) (l,m + l\L+\l,m) , (7.14) 
where on the right-hand-side we used that only the m + 1-term survives. This leads to 

[1(1 + l)-m(m + l)]h 2 = \(l,m + 1\L+ \l,m) | 2 . (7.15) 
The ladder operators then act as 

L+\l,m) = h\/l(l + 1) - m(m + l)\l,m + 1) , (7.16) 

and 

L-\l,m) = hy/l(l + l)-m(m-l)\l,m-l) . (7.17) 



We have seen that the angular momentum L is quantized, and that this gives rise 
to a discrete state space parameterized by the quantum numbers I and m. However, we 
still have to restrict the values of I further, as mentioned above. We cannot do this using 
only the algebraic approach (i.e., using the commutation relations for Lj), and we have to 
consider the spatial properties of angular momentum. To this end, we write Lj as 



L; = -ihe 



ijk 



Xj dx k J ' 



(7.18) 



which follows directly from the promotion of r and p in Eq. (7.1) to quantum mechanical 
operators. In spherical coordinates, 



x 2 + y 2 + z 2 , = arctan |— j, 6 = arctan 



\/ x 2 +y 2 



(7.19) 



the angular momentum operators can be written as 



-ih 



L v = -ih 



-sind) cot0cos</> — , 

d d 
cosd) cote/ sin (/> — 



d(p 



-h Y 



1 d ( d 
smtf 



sinddO 



+ 



1 



dd ) sir? eddy* 



The eigenvalue equation for L z then becomes 



d 



L z y/(r,d,d)) = -ih—iff(r,6,dj) = mhtyir ,0 ,<p) 
ad) 



(7.20) 



(7.21) 
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We can solve this differential equation to find that 

y{r,e,<p) = ar,e)e im(p . (7.22) 

A spatial rotation over 2n must return the wave function to its original value, because 
y/(r,9,(p) must have a unique value at each point in space. This leads to y/(r,8,<p + 2n) = 
y/(r,9,(p) and 

e im^+2n) = e im<l> ^ Qr e 2nim = 1 _ (7 23) 

This means that m is an integer, which in turn means that I must be an integer also. 



7.2 Spin 

For orbital angular momentum we found that 21 + 1 must be an integer, and moreover 
the spatial properties of the wave function force I to be an integer as well. However, we 
can also construct states with half-integer I, but this must then be an internal degree of 
freedom. This is called spin angular momentum, or spin for short. We will show later in 
the course that the spin observable is interpreted as an intrinsic magnetic moment of a 
system. 

To describe spin, we switch from L to S, which is no longer related to r and p. The 
commutation relations between the components Si are the same as for Li, 

[Si,Sj] = ihe ijk S k , (7.24) 

so S and L obey the same algebra. The commutation relations between S and L, r, and p 
vanish: 



[Si,Lj] = [Si, rj ] = [Si, Pj ] = 0. (7.25) 

Therefore, spin generates a whole new vector space, since it commutes with observables 
that themselves do not commute (like rj and pj), and it is independent of the spatial 
degrees of freedom. 

Since the commutation relations for S (its algebra) are the same as for L, we can 
immediately copy the algebraic structure of the eigenstates and eigenvalues: 

1 3 

S z \s,m s ) = m s h\s,m s ) , with s = 0, -, 1, -,2,. . . 

S 2 |s,m s > = s(s + l)h 2 \s,m s ) . (7.26) 

When s = |, the system has two levels (a qubit) with spin eigenstates \ \,\) and ||,-|). 
We often write m s = +^ =} ("up") and m s = -| =| ("down"), which finds its origin in the 
measurement outcomes of electron spin in a Stern-Gerlach apparatus. 

Now that we have introduced a whole new vector space related to spin, how do we 
write the wave function of a particle with spin? Without spin, the wave function is a 
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normal single-valued function if/(r,t) = {r\y/(t)) of space and time coordinates. Now we 
have to add the spin degree of freedom. For each spin (f or J when s = |), we have a wave 
function y/](r, t) for the particle with spin up, and i^j(r,i) for the particle with spin down. 
We can write this as a vector: 



V(r,t) = 



(7.27) 



The spin degree of freedom generates a vector space, after all. The vector f is called a 
spinor. 

Expectation values are evaluated in the usual way, but now we have to sum over the 
spin degree of freedom, as well as integrate over space. For example, the probability of 
finding a particle with spin up in a region Q of space is given by 

p(lQ)= £ [ dr6 msA \yi ms (r,t)\ 2 = [ dr|^ t (r,£)| 2 , (7.28) 

and the expectation value of finding a particle with any spin in a region D. of space is given 
by 

p(C)= £ [ dr\y ms (r,t)\ 2 . (7.29) 

The normalization of the spinor is such that 

£ [ dr\y ms (r,t)\ 2 = l, (7.30) 

where V is the entire space available to the particle (this may be the entire universe, or 
the volume of a box with impenetrable walls, etc.). 

If spin is represented by (2s + l)-dimensional spinors (vectors), then spin transforma- 
tions (operators) are represented by (2s + 1) x (2s + 1) matrices. In the two-dimensional 
case, we have by construction: 



H 



o) and ^ > = -2(l)' 



(7.31) 



which means that the matrix representation of S z is given by 

fl(l 



S..-\ _J. (7.32) 

Next, the ladder operators act according to 

S + lt>=0, S + |l> = fc|t>, S_lt> = ft|l>, S_I1> = 0, (7.33) 
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which leads to the matrix representation 

(0 1) 



S + = h 



[o o 



and S- = h 



From S+ - S x ± iS v we can then deduce that 



Sr. — 



h 



o 1 
[1 



and S 



h 



y 





U o 



-i 



(7.34) 



[i ) ' 



(7.35) 



We often define Si = \hai, where Oi are the so-called Pauli matrices. Previously, we have 
called these matrices X, Y, and Z. The commutation relations of the Pauli matrices are 



2 ' 2 



Ok 



[ai,aj] = 2ieijk(Jk or 
Other important properties of the Pauli matrices are 

{(Tj, cry} = (TiUj + a jO[ = 26iji (anti-commutator). 



(7.36) 



(7.37) 



They are both Hermitian and unitary, and the square of the Pauli matrices is the identity: 
a 2 : = D. Moreover, they obey an "orthogonality" relation 



1 



The proof of this statement is as follows: 

OiOj = OiOj + OjOi-OjOi = {ai,aj}-aj(Ji = 28ij\\-ajUi 
Taking the trace then yields 

Tr(25y0 - ajUi) = Triaiaj) = Tria^i), 

or (using Tr(D) = 2) 

2 r Yr{a j a i ) = A8 ij , 



(7.38) 



(7.39) 



(7.40) 



(7.41) 



which proves Eq. (7.38). If we define cjq = D, we can extend this proof to the four-dimensional 
case 



1. 



^Tr^crv)^^, 



(7.42) 



with /i, v = 0,1, 2, 3. 

We can then write any 2x2 matrix as a sum over the two-dimensional Pauli operators: 



A* 



(7.43) 
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since 

^Tr(Aa v )= -Tr £a M a^ff v = -^o^Tr^ffv) = ^a^ v = a v . (7.44) 

The Pauli matrices and the identity matrix form a basis for the 2x2 matrices, and we can 
write 



A = aoD + a-cr 
where we used the notation cr = (a x ,a y ,a z ). 



'a + a z a x -ia y 
a x + ia y ao~a z 



(7.45) 



7.3 Total angular momentum 

In general, a particle may have both spin and orbital angular momentum. Since L and 
S have the same dimensions, we can ask what is the total angular momentum J of the 
particle. We write this as 

J = L + S = L®D + D®S, (7.46) 

which emphasizes that orbital and spin angular momentum are described in distinct 
Hilbert spaces. 

Since [Li,Sj] = 0, we have 

[Ji,Jj] = [Li+Si,Lj + Sj] = [Li,Lj] + [Si,Sj] = iheijkL k + iheijkSk = iheij k (L k + S k ) 
= ihei jk J k . 

{IAD 

In other words, J obeys the same algebra as L and S, and we can immediately carry over 
the structure of the eigenvalues and eigenvectors from L and S. 

In addition, L and S must be added as vectors. However, only one of the components 
of the total angular momentum can be sharp (i.e., having a definite value). Recall that 
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I and s are magnitudes of the orbital and spin angular momentum, respectively. We can 
determine the extremal values of J, denoted by + j, by adding and subtracting the spin 
from the orbital angular momentum, as shown in Figure 3: 

\l-s\<j<l+s. (7.48) 

For example, when 1 = 1 and s = \, the possible values of j are j -\ and j ' = |. 

The commuting operators for J are, first of all, J 2 and J z as we expect from the algebra, 
but also the operators L 2 and S 2 . You may think that S z and L z also commute with these 
operators, but that it not the case: 

[J 2 ,L Z ] = [(L + S) 2 ,L 2 ] = [L 2 + 2L • S + S 2 ,L 2 ] = 2[L,L 2 ] • S ? . (7.49) 

We can construct a full basis for total angular momentum in terms of J 2 and J z , as before: 

J 2 \j,mj) = h 2 j(j +l)\j,mj) and J z \j,mj) = mjh \j,rrij) . (7.50) 

Alternatively, we can construct spin and orbital angular momentum eigenstates directly 
as a tensor product of the eigenstates 

L 2 \l,m) \s,m s ) = H 1(1 + l)\l,m) \s,m s ) and L z \l,m) \s,m s ) = mh \l,m) \s,m s ) , (7.51) 

and 

S 2 \l,m) \s,m s ) = h 2 s(s + 1) \l,m) \s,m s ) and S z \l,m) \s,m s ) = m s h\l,m) \s,m s ) . (7.52) 

Since the L z and S z do not commute with J 2 , the states \ j,nij) are not the same as the 
states \l,m) \s,m s ). 

7.4 Composite systems with angular momentum 

Now consider two systems, 1 and 2, with total angular momentum Ji and J2, respectively. 
The total angular momentum is again additive, and given by 

J = J 1+ J 2 = J"i <8> D + D ® J 2 . (7.53) 

Completely analogous to the addition of spin and orbital angular momentum, we can con- 
struct the commuting operators J 2 , J z , J 2 , and J|, but not J\ z and J<iz- Again, we con- 
struct two natural bases for the total angular momentum of the composite system, namely 

{\j,m)} or {|j'i,mi> ® l72,m2» = {|j'i,j2,mi,m 2 >}. (7.54) 

We want to know how the two bases relate to each other, because sometimes we wish to 
talk about the angular momentum of the composite system, and at other times we are 
interested in the angular momentum of the subsystems. Since the second basis (as well as 
the first) in Eq. (7.54) forms a complete orthonormal basis, we can write 

\j,m)= Y. U1J2, mi, m 2 > (ji, 72, mi, m 2 \j,m). (7.55) 

7711,7712 
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The amplitudes (ji,j2,m\,m2\j,m) are called Clebsch-Gordan coefficients, and we will 
now present a general procedure for calculating them. 

Let's first consider a simple example of two spin-| systems, such as two electrons. The 
spin basis for each electron is given by \ \,\) = It) and ||, - ^) = ID- The spin basis for the 
two electrons is therefore 

lji,j2,m 1 ,m 2 >G{|t,t>,lt,|>,||,t>,||,|>}. (7.56) 

The total spin is given by j ' = \ + \ = 1 and j = \ - \ = 0, so the four basis states for total 
angular momentum are 

\j,m) €{\1,1), |1,0), |1,-1), |0,0)}. (7.57) 

The latter state is the eigenstate for j = 0. The maximum total angular momentum state 
|1, 1) can occur only when the two electron spins a parallel, and we therefore have 

H,i> = IT,T> = li,|)®li»5>- (7 - 58) 

To find the expansion of the other total angular momentum eigenstates in terms of spin 
eigenstates we employ the following trick: use that J+ = J\+ + J2+. We can then apply J+ 
to the state |1,1), and Ji+ + J2+ to the state |^,^)® |^>^)- This yields 

J_|l,l> = ^ v / j'0'+l)-m(m-l)|l,0> = ^V2|l,0> . (7.59) 
Similarly, we calculate 



J 1 _|i,i) = ^ v /i(|)-i(-i)|i,-i> = ^|i,-i>, (7.60) 
and a similar result for J2-. Therefore, we find that 

W2|l,0> = ftlU> + »IU> => H,0>= IT,i> tJ i,T> . (7.61) 

V2 

Applying J_ again yields 

|1,-1) = ||,|). (7.62) 

This agrees with the construction of adding parallel spins. The three total angular mo- 
mentum states 

|l,D = lt,t), 
|l,0) = ^(|t,|) + ||,t)), 

|1,-1) = ||,|) (7.63) 
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form a so-called triplet of states with j = 1. We now have to find the final state correspond- 
ing to j = 0, m = 0. The easiest way to find it at this point is to require orthonormality of 
the four basis states, and this gives us the singlet state 

|0,0> = ^(|U>-|U». (7.64) 

The singlet state has zero total angular momentum, and it is therefore invariant under 
rotations. 

In general, this procedure of finding the Clebsch-Gordan coefficients results in mul- 
tiplets of constant j. In the case of two spins, we have a tensor product of two two- 
dimensional spaces, which are decomposed in two subspaces of dimension 3 (the triplet) 
and 1 (the singlet), respectively. We write this symbolically as 

2®2 = 3©1. (7.65) 

If we had combined a spin 1 particle with a spin ^ particle, the largest multiplet would 
have been due to j = 1 + 1 = |, which is a 4-dimensional subspace, and the smallest sub- 
space is due to j ' = 1 - \ = h, which is a two-dimensional subspace: 

3®2 = 4©2. (7.66) 

In general, the total angular momentum of two systems with angular momentum k and I 
is decomposed into multiplets according to the following rule (k>l): 

(2fc + l)® (21 + 1) = [2(fc + + l]©[2(fc + -l]©...©[2(fc-0 + l], (7.67) 

or in terms of the dimensions of the subspaces (n > m): 

n®m = (n + m — 1)© (n + m — 3)© ... © (n — m + 1). (7.68) 

Exercises 

1. Angular momentum algebra. 

(a) Prove the algebra given in Eq. (7.2). Also show that [L 2 ,LJ = 0, and verify the 
commutation relations in Eq. (7.7). 

(b) Show that L 2 \l,m) = 1(1 + l)h 2 \l,m). Use the fact that [L_,L 2 ] = 0. 

2. Pauli matrices. 

(a) Check that the matrix representation of the spin-| operators obey the commu- 
tation relations. 

(b) Calculate the matrix representation of the Pauli matrices for s = 1. 

(c) Prove that exp[-i# • cr] is a 2 x 2 unitary matrix. 
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3. Isospin I describes certain particle families called multiplets, and the components of 
the isospin obey the commutation relations Ui,Ij] = ieijjj.%. 

(a) What is the relation between spin and isospin? 

(b) Organize the nucleons (proton and neutron), the pions (n + , n°, and n~), and the 
delta baryons (A ++ , A + , A , and A") into multiplets. You will have to determine 
their isospin quantum number. 

(c) Give all possible decay channels of the delta baryons into pions and nucleons 
(use charge and baryon number conservation). 

(d) Calculate the relative decay ratios of A + and A into the different channels. 

4. A simple atom has orbital and spin angular momentum, and the Hamiltonian for 
the atom contains a spin-orbit coupling term H so = gh L • S, where gh is the coupling 
strength. 

(a) Are orbital and spin angular momentum good quantum numbers for this sys- 
tem? What about total angular momentum? 

(b) Use first-order perturbation theory to calculate the energy shift due to the spin- 
orbit coupling term. 

(c) Calculate the transition matrix elements of H so in the basis {\l,m;s,m s )}. 

5. Multiplets. 

(a) A spin | particle and a spin 2 particle form a composite system. How many 
multiplets are there, and what is the dimension of the largest multiplet? 

(b) How many multiplets do two systems with equal angular momentum have? 
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8 Identical Particles 

We have so far looked at the quantum mechanical description of a few particles with spin 
in the previous section, and particles that exhibit entanglement in section 5. In all these 
cases, we assumed that the individual particles could be distinguished from each other. 
For example, the two-electron state (It I) - llt))/ 1 /^ assumes that we have two electrons, 
one held "over here", and the other "over there", and we can talk meaningfully about their 
respective spins. The tensor product structure of our Hilbert space is a manifestation of 
our ability to label our particles unambiguously. 

However, what happens when we place the two electrons inside a sealed box? The wave 
functions of the electrons will quickly start to overlap. Since the electrons are identical 
particles, which according to basic quantum mechanics do not have well-defined paths, we 
cannot keep track of which electron is which inside the box. Not even in principle. 

8.1 Symmetric and anti-symmetric states 

The indistinguishability of identical particles means that we have to adjust our quantum 
mechanical description of these objects. There are two ways of doing this, namely via 
a modification of the allowed states and via a restructuring of the observables 6 . In this 
section we consider the restricted state space, and in the next we will be considering the 
new observables. 

First of all, since the total number of particles is an observable quantity (for example 
by measuring the total charge in the box), we can give the particles an artificial labelling. 
The wave functions of the two particles are then given by ^(rij)-^ for particle 1 at position 

and |0(r 2 )) 2 for particle 2 at position r 2 . Since we can swap the positions of the particle 
without observable consequences, we find that there are two states that denote the same 
physical situation: 

|i//(ri),0(r 2 )) 12 and |^(r 2 ),0(ri)) 12 . (8.1) 

However, we wish that each physically distinct situation has exactly one quantum state. 
Since there is no preference for either state, we can denote the physical situation of identi- 
cal particles at position ri and r 2 by the quantum state that is an equal weight over these 
two possibilities: 

k(ri),0(r 2 )> 12 + e^|^(r 2 ),0(ri)> 12 
l x P(ri,r 2 )>i2 = - 12 ' — . (8.2) 

You can verify that swapping the positions ri and r 2 of the indistinguishable particles 
incurs only a global (unobservable) phase. The question is now how we should choose (p. 

Suppose that the two identical particles in the box are electrons. We know From Pauli's 
exclusion principle that the two electrons cannot be in the same state. Therefore, when 



6 This is sometimes called second quantisation. This is a misnomer, since quantisation occurs only once, 
when observables are promoted to operators. 
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(p = y/, the state in Eq. (8.2) should naturally disappear: 



.™, „ Mri),^r 2 )) 12 W»|y(r 2 ),^ri)) 12 

I Y(ri,r2))i2 = ^= = U, 



(8.3) 



which means that for particles obeying Pauli's exclusion principle we must choose e l(p = — 1. 
The quantum state of the two particles is anti-symmetric. 

What about particles that do not obey Pauli's exclusion principle? These must be re- 
stricted to states that are orthogonal to the anti-symmetric states. In other words, they 
must be in states that are symmetric under the exchange of two particles. For the two 
identical particles in a box, we therefore choose the value e ltp = +1, which makes the state 
orthogonal to the anti-symmetric state. The two possibilities for combining two identical 
particles are therefore 



|W s (n,r 2 )> 



I^A(ri,r 2 )> 



|vKri),0(r 2 )> + |v(r 2 ),0(ri)) 
|tp(ri),0(r 2 )) - |i//(r 2 ),0(ri)) 



(8.4) 



These states include both the internal degrees of freedom, such as spin, and the external 
degrees of freedom. So two electrons can still be in the state Iff), as long as their spatial 
wave function is anti-symmetric. The particles that are in a symmetric overall quantum 
state are bosons, while the particles in an overall anti-symmetric state are fermions. 

We can extend this to N particles in a fairly straightforward manner. For bosons, we 
sum over all possible permutations of ri to r^v: 



\^ s (r r N )) 



1 



E |^i(ri),...,^Ar(riv)> 

• perm(ri,...,r?v) 



(8.5) 



For fermions, the odd permutations pick up a relative minus sign: 

|¥ A (ri,...,rjv)) = £ ki(ri),...,i^iv(riv)>--^Ek^ r i)'---'^(riv)>. (8.6) 

even V N\ odd 

This can be written compactly as the so-called Slater determinant 



¥ A (ri,...,rjv) : 



1 



Vi(ri) y/i(r 2 ) ... ^i(rjv) 
V2(ri) i^ 2 (r 2 ) ... ^ 2 (rjv) 

^iv(ri) Vivfo) ••• V N (r N ) 



(8.7) 



where we removed the kets for notational convenience. The N particles in the state 
I^A^ij- • • > r iv)) automatically obey the Pauli exclusion principle. 
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8.2 Creation and annihilation operators 

The second, and particularly powerful way to implement the description of identical parti- 
cles is via creation and annihilation operators. To see how this description arises, consider 
some single-particle Hermitian operator A with eigenvalues aj. On physical grounds, and 
regardless of distinguishability we require that nj particles in the eigenstate \aj) of A 
must have a total physical value rtj x aj for the observable A. We can repeat this for all j, 
and obtain a potentially infinite set of basis vectors 

\n 1 ,n2,n 3 ,...) , 

for all integer values of nj, including zero. The spectrum of A can be bounded or un- 
bounded, and discrete or continuous. It may even be degenerate. For simplicity, we con- 
sider here an unbounded, non-degenerate discrete spectrum. 
A special state is given by 

|0) = |0,0,0,...>, (8.8) 

which indicates the state of no particles, or the vacuum. The numbers nj are called the 
occupation number, and any physical state can be written as a superposition of these 
states: 

oo 

c «l,n 2 ,n 3 ,... 1^1,712,713,...). (8.9) 

ni,»2,»3,-=0 

The basis states \n\, 712,713,...) span a linear vector space called aFock space & . It is the 
direct sum of the Hilbert spaces for zero particles J€q, one particle J6\, two particles, etc.: 

& = je$® je x ® je 2 ® je z ® ■ ■ ■ (8.10) 

Since | *¥) is now a superposition over different particle numbers, we require operators 
that change the particle number. These are the creation and annihilation operators, d^ 
and d respectively. Up to a proportionality constant that we will determine later, the 
action of these operators is defined by 

d f j \n 1 ,n 2 ,...,n j ,...)(x \n 1 ,n 2 ,...,n j + l,...) , 

dj\ni,n 2 ,...,nj,...)oc \n 1 ,n 2 ,...,n j - 1,...) . (8.11) 

These operators are each others' Hermitian adjoint, since removing a particle is the time 
reversal of adding a particle. Clearly, when an annihilation operator attempts to remove 
particles that are not there, the result must be zero: 

d j \n 1 ,n 2 ,...,n j = 0,...) = 0. (8.12) 

The vacuum is then defined as the state that gives zero when acted on by any annihilation 
operator: dj\0) = for any j. Notice how we have so far sidestepped the problem of 
particle swapping; we exclusively used aspects of the total particle number. 
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So far, we have considered only the basis states of many particles for a single observ- 
able A. What about other observables, in particular those that do not commute with A? 
We can make a similar construction. Suppose an observable B has eigenvalues bj. We can 
construct creation and annihilation operators b^ and bj that act according to 

bj\mi,m,2,...,mj,...)<x \mi,m2,...,mj + l,...) , 
bj\mi,m2,...,mj,...)(x |mi,m2,...,m/-l,...) . (8.13) 

where mj is the number of particles with value bj. Typically, the basis states of two 
observables are related via a single unitary transformation \bj) = U\a,j) for all j. How 
does this relate the creation and annihilation operators? 

To answer this, let's look at the single particle states. We can write the single-particle 
eigenstates \aj) and \ bj) as 

|a/) = dt|0> and |&,) = &t|0>. (8.14) 

We assume that U does not change the vacuum 7 , so U\0) = |0). This means that we can 
relate the two eigenstates via 

\bj) = U \aj) = UaX\0) = Ua] itfu) |0> = Ud]U^ |0> = St |0> , (8.15) 

where we have strategically inserted the identity U*U. This leads to the operator trans- 
formation 

P. = UaXu f . (8.16) 
j j 

The Hermitian adjoint is easily calculated as bj = UdU\ It is left as an exercise for you to 
prove that 

bj = Y, u jk d i and bj = Y J U* kj a k , (8.17) 
k k 

where u jk = (ak\bj). 

What are the basic properties of the creation and annihilation operators? In particular, 
we are interested in their commutation relations. We will now derive these properties 
from what we have determined so far. First, note that we can create two particles with 
eigenvalues b i and bj in the system in any order, and the only difference this can make is 
in the normalisation of the state: 

b\b]m = Xpfilm , (8.18) 



7 This is a natural assumption when we are confined to the single particle Hilbert space, but there are 
general unitary transformations for which this does not hold. 
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where A is some complex number. Now let's look at the same process in the basis of the 
observable A. We have 

(b]b) - AfiJfiJ) m=Y,u ik Uji [d\d\ - Xd}d\] m = 0. (8.19) 
kl 

Since the Uik and Uji are the matrix elements of an arbitrary unitary transformation, and 
the state is certainly not zero, we require that 

d\d\-Xd\d\ = 0. (8.20) 

Since k and I are just dummy variables, we equally have 

d\d\-Xd\d\ = Q. (8.21) 

We now substitute Eq. (8.21) into Eq. (8.20) to eliminate djdt- This leads to 

(1-A 2 )djd} = 0, (8.22) 

and therefore 

A = ±l. (8.23) 

The relation between different creation operators can thus take two forms. They can obey 
a commutation relation when A = +1: 



d \ d \- & l d \ 



44 



0, (8.24) 
or they can obey an anti-commutation relation when A = -1: 

d\d\ +44 = {a],a\} = 0, (8.25) 

While creating the particles in different temporal order is not the same as swapping two 
particles, it should not come as a surprise that there are two possible situations (the com- 
mutation relation and the anti-commutation relation). We encountered two possibilities 
in our previous approach as well, where we found that many-particle states are either 
symmetric or anti-symmetric. In fact, creation operators that obey the commutation rela- 
tion produce symmetric states, while creation operators that obey the anti-commutation 
relation produce anti-symmetric states. We also see that the creation operators described 
by the anti-commutation relations naturally obey Pauli's exclusion principle. Suppose 
that we wish to create two identical particles in the same eigenstate \aj). The anti- 
commutation relations say that {dj,dj} = 0, so 

df = 0. (8.26) 

Any higher powers of dt will also be zero, and we can create at most one particle in the 
state \aj). 
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Taking the adjoint of the commutation relations for the creation operators gives us the 
corresponding relations for the annihilation operators 



didk-dk&i = [di,dk] = 0, 



(8.27) 



or 



didk+dkdi = {di,dk} = 0. 



(8.28) 



The remaining question is now what the (anti-) commutation relations are for products of 
creation and annihilation operators. 

We proceed along similar lines as before. Consider the operators bi and St with j ^ k, 
and apply them in different orders to a state | . 



(8.29) 



However, now we have to be a little bit careful about the case where we create and anni- 
hilate a particle in the same eigenstate, so we first consider the case where i ^ j. In terms 
of the observable A, this becomes 



[bib] -nb]bi) m = £ u * ki uji [& k &\ - n&\&k) m = o, 



(8.30) 



and the same argumentation as before leads to ji = ±1. For different k and I we therefore 
find 



dk,d\ 



or |dfe,dj| = 0. 
Let's consider the special case where - \0). We find 



[bib] -Lib)bi) |0) = bS) |0) = Sij |0) . 
We want to know what effect this has on the case where I = k in Eq. (8.30): 
\bib\-nb\b^\0) = Y. u li u Jk \dkd\ - fxd\di^\0) = 8ij\0) . 



(8.31) 



(8.32) 



(8.33) 



Since Y.k u \i u jk is just the component form of U^U, we find that Y.k u \i u jk = 8ij and the 
term in brackets must be equal to 1. This gives us our final set of relations: 



dk,d\ 



= 1 



or 



1. 



(8.34) 



To summarise, we have two sets of algebras for the creation and annihilation operators. 
The algebra in terms of the commutation relations is given by 



[dk,dl] 







and 



dk,d\ 



■■Ski- 



(8.35) 
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This algebra describes particles that obey Bose-Einstein statistics, or bosons. The algebra 
in terms of anti-commutation relations is given by 

{d k ,dl} = \&\,&\^ = and \d k ,d\\ = 8 k i. (8.36) 

This algebra describes particles that obey Fermi-Dirac statistics, or fermions. 

Finally, we have to determine the constant of proportionality for the creation and anni- 
hilation operators. We have already required that 6;St |0) = Sij |0). To determine the rest, 
we consider a new observable that gives us the total number of particles in the system. We 
denote this observable by n, and we see that it must be additive over all particle numbers 
for the different eigenvalues of A: 

rc = X>,-, (8.37) 

j 

where nj is the number of particles in the eigenstate a/). The total number of particles 
does not change if we consider a different observable (although the distribution typically 
will), so this relation is also true when we count the particles in the states \bj). Pretty 
much the only way we can achieve this is to choose 

n = Lnj = Ld]d j = Y^b]b j . (8.38) 

j j j 

For the case of rij particles in state \a,j) this gives 

^j\n j ) = n j \n j ), (8.39) 

where we ignored the particles in other states |a&) with k ^ j for brevity. For the Bose- 
Einstein case this leads to the relations 

& j\ n j) = V K j\ n J~ 1 ) and & )\ n j) = \J n j + 1 \ n j + 1 ) ■ ( 8 - 4 °) 

For Fermi-Dirac statistics, the action of the creation and annihilation operators on number 
states becomes 

dj\0)j = and a) |0> y - = e~ ia , 

dj\l)j = e ia \0)j and d]|l> y = 0. (8.41) 
The phase factor e ia can be chosen ±1. 

How do we construct operators using the creation and annihilation operators? Suppose 
that a one-particle observable H has eigenvalues Ej and eigenstates \j). This can be, for 
example the Hamiltonian of the system, which ensures that the physical values of the 
particles (the eigenvalues) are additive. The operator for many identical particles then 
becomes 



H = Y.E j n j = Y.E j d)d j , 

j j 



(8.42) 
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which transforms according to Eq. (8.17). More generally, the operator may not be written 
in the eigenbasis | re i, re 2,...), in which case it has the form 

H = L H ijb]bj, (8.43) 

ij 

where Hij are matrix elements. The creation and annihilation operators aX and dj diago- 
nalise H, and are sometimes called normal modes. The reason for this is that the creation 
and annihilation operators for bosons obey the same mathematical rules as the raising 
and lowering operators for the harmonic oscillator. The index j then denotes different os- 
cillators. A system of coupled oscillators can be decomposed into normal modes, which are 
themselves isolated harmonic oscillators. 



8.3 The spin-statistics theorem 

We have found that there are two types of identical particles in quantum mechanics, 
namely bosons and fermions. We derived this in two ways, via the symmetry of the wave 
function, and via the operator algebras of the creation and annihilation operators. You 
undoubtedly know that bosons always have integer spin, while fermions always have half- 
integer spin. But we have not talked about the spin of the particles in the derivations, 
so why exactly is there a relationship between the spin of the particles and the statis- 
tics (Bose-Einstein or Fermi-Dirac) that they obey? This connection is given by the spin- 
statistics theorem. 

Formal derivations of the spin-statistics theorem typically involves relativistic quan- 
tum field theory, but there are heuristic arguments that give a feel for its origin. Consider 
two identical particles with spin s. The eigenstates for the 2 -component y/ S m(r,%) can be 
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written as 

V.sn(T,x) = e im *Vs(T), (8.44) 

where r is the position of the particle and % is the parameter that produces the global 
phase factor ±1 upon a rotation over 2n depending on the spin of the particle. We can 
now consider the exchange of two identical particles at i*i and r2 as a rotation around a 
common axis (see Fig. 4). The rotation then becomes 

= e 2^(i) (r2) ^(2) (ri) . (8 .45) 

You see that for half-integer spin the global phase factor is -1, leading to anti-symmetric 
quantum states. Note that % is not a spatial angle, but a parameter in spin space. 



8.4 Bose-Einstein and Fermi-Dirac statistics 



Finally, in this section we will derive the Bose-Einstein and Fermi-Dirac statistics. In par- 
ticular, we are interested in the thermal equilibrium for a large number of (non-interacting) 
identical particles with some energy spectrum Ej, which my be continuous. 

Since the number of particles is not fixed, we are dealing with the Grand Canonical 
Ensemble. Its partition function S is given by 



S = Tr 



(8.46) 



where H is the many-body Hamiltonian, (5 = 1/£#T and /i is the chemical potential. The 
average number of particles with single particle energy E j is then given by 



ldlnH 



P dE :i 



(8.47) 



For the simple case where H = Y.jEjAj and the creation and annihilation operators obey 
the commutator algebra, the exponent can be written as 



exp 

and the trace becomes 



E 



\ n j)( n j\ 



(8.48) 



j nj=0 



1 

I _ e (Kn-Ej) 



(8.49) 



The average photon number for energy E , is 



1 dlnE 

{n j ) = -p-dE- 



1 dZ 



pZdEj e -^-Ej)_i 



(8.50) 
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(nj) („.) 




Figure 5: Left: Bose-Einstein distribution for different temperatures (ju = 0). The lower 
the temperature, the more particles occupy the low energy states. Right: Fermi-Dirac dis- 
tribution for different temperatures and jU = 1. The fermions will not occupy energy states 
with numbers higher than 1, and therefore higher energies are necessarily populated. The 
energy values Ej form a continuum on the horizontal axis. 



This is the Bose-Einstein distribution for particles with energy Ej. It is shown for increas- 
ing Ej in Fig. 5 on the left. 

Alternatively, if the creation and annihilation operators obey the anti-commutation 
relations, the sum over rij in Eq. (8.48) runs not from to oo, but over and 1. The 
partition function of the grand canonical ensemble then becomes 

j 

and the average number of particles with energy Ej becomes 

This is the Fermi-Dirac statistics for these particles, and it is shown in Fig. 5 on the right. 
The chemical potential is the highest occupied energy at zero temperature, and in solid 
state physics this is called the Fermi level. Note the sign difference in the denominator 
with respect to the Bose-Einstein statistics. 

Exercises 

1. Calculate the Slater determinant for three electrons and show that no two electrons 
can be in the same state. 
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2. Particle statistics. 

(a) What is the probability of finding n bosons with energy Ej in a thermal state? 

(b) What is the probability of finding n fermions with energy Ej in a thermal state? 

3. Consider a system of (non-interacting) identical bosons with a discrete energy spec- 
trum and a ground state energy E . Furthermore, the chemical potential starts out 
lower than the ground state energy /j,<Eq. 

(a) Calculate (no) and increase the chemical potential to p — ► Eq (e.g., by lowering 
the temperature). What happens when p passes Eq? 

(b) What is the behaviour of (^thermal) = ( n j} as A 4 ~~ * ^o? Sketch both (no) and 
(^thermal) as a function of p. What is the fraction of particles in the ground state 
at p = £ ? 

(c) What physical process does this describe? 

4. The process U = exp(r d\d\ - r* 0,16,2) with r eC creates particles in two systems, 1 
and 2, when applied to the vacuum state I^F) = U\0). 

(a) Show that the bosonic operators d\d\ and d\d^ obey the algebra 

[K_,K + ] = 2K and [K ,K ± ] = ±K ± , 

withi<: + =Kl. 

(b) For operators obeying the algebra in (a) we can write 



e r* + -r-i_ =exp 



— tanhlrl^ 
\r\ 



exp 



-21n(cosh|r|) J R:o 



exp 



- — tanh|r|X"_ 
\r\ 



Calculate the state \*¥) of the two systems. 

(c) The amount of entanglement between two systems can be measured by the 
entropy S(r) of the reduced density matrix pi = Tritp] for one of the systems. 
Calculate S(r) = -Trfpilnpi]. 

(d) What is the probability of finding n particles in system 1? Compare this to your 
findings in question 2(a). 
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9 Many-Body Problems in Quantum Mechanics 

In this final section we study a variety of problems in many-body quantum mechanics. 
First, we introduce the Hartree-Fock method for taking into account the effect of electron- 
electron interactions in atoms. Next, we describe spin waves in magnetic materials using 
the Heisenberg model. Third, we describe the behaviour of an atom interacting with pho- 
tons in a cavity, and introduce the Jaynes-Cummings Hamiltonian. And finally, we take a 
brief look at the basic ideas behind quantum field theory. 

9.1 Interacting electrons in atomic shells 

Previously, you have encountered the Schrodinger equation for a particle in a central po- 
tential, which can be interpreted as an electron bound to a proton in a hydrogen atom. 
We found that the energy levels are quantised with quantum number n. In addition, the 
spin and orbital angular momentum is quantised with quantum numbers m s , I, and m, 
respectively (we assume that s = \ since we are considering electrons). We can denote the 
set of quantum numbers n, m s , I, and m by greek indices a, fi,..., and the wave-functions 
u a (r) then form a complete orthonormal basis for the bound electron. 

It is tempting to keep this complete orthonormal basis for other atoms as well, and 
assume that the ground state of an iV-electron atom is the tensor product of the iV lowest 
energy eigenstates, appropriately anti-symmetrized via the Slater determinant. Indeed, 
the periodic table is based on this assumption. However, this ignores the fact that the 
electrons interact with each other, and the ground state of a many-electron atom is differ- 
ent. The Hartree-Fock method is designed to take this into account. It is a constrained 
variational approach, in which the trial state that is optimised over is forced to be a Slater 
determinant in order to keep the correct particle statistics. In this section we present the 
Hartree-Fock method, and arrive at the Hartree-Fock equations, which can be solved iter- 
atively. We follow the derivation given by Bransden and Joachain (Physics of Atoms and 
Molecules, 1983 pp. 320-339). 

First, we specify the Hamiltonian. Using the notation r; = for the distance of the 
i th electron from the nucleus, and r^- = |rj -r,| for the distance between electrons i and j, 
we find that 



where we used units in which e/4neo = 1 and the Hamiltonian is divided into the single 
electron Hamiltonian (Hi) and the inter-electron Hamiltonian (H 2 ). We choose as a nor- 
malised trial quantum state ^(ri, . . . ,r^) the Slater determinant 




(9.1) 



W(r 1 ,...,r N ) = 



1 



u a (ri) 
u a (r 2 ) 



up(ri) 
up(r 2 ) 



u v (ri) 
u v (r 2 ) 



(9.2) 



U a (rN) 



up(r N ) 



u v (r N ) 
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and calculate the expectation value of the Hamiltonian H. This must be larger or equal to 
the ground state Eq 

(y\Hm>E , (9.3) 

and varying the trial state then allows us to minimise the expectation value. This will get 
us close to the ground state energy. 

Since the Slater determinant is a rather large expression, it will save us quite a bit of 
writing if we introduce the anti-symmetrisation operator si, such that 

W(n, ...,r N )= VW. *fua(ri)up(r 2 ) . . . u v (r N ) = VW. ^<5 H , (9.4) 

where $n(ri, . . . ,rjv) is the Hartree wave function. The operator st can then be written as 
a sum over permutations P of the labels a, ft, . . . v: 

and st is a projection operator: si 2 = si = si*. Both Hi and i?2 commute with si. 

Next, we calculate the expectation values (WIH^) and ( x ¥\H 2 \ y i'). Since [H h si] = 0, 
we can write 

(WIHilY) = N\ (OhI^-H'i^I^h) = N\ (OhI-H'i^I^h) = Nl (® H \Hisi\(!> H ) ■ (9.6) 

A permutation of the labels a, /3, . . . v leads to an orthonormal state and if i is a sum over 
one-electron Hamiltonians. We can therefore write this as 

N 

{W\Hi\W)=Nl <O h |Hi^|$h> = £(-l) P <<t> H |#iP|<I>H> = £ <O h |A;|<J>h> 

P i = l 

= £>„(ri)|&i|u a (ri)> = £/ a (9.7) 

a a 

where hi is the single electron Hamiltonian 

* ft q Z 

h ' = -i^'-n (9 ' 8> 

Next, we calculate the expectation value ( v P|if2l v f / ) of the two-electron interaction 
Hamiltonians. Using the same reasoning as in Eq. (9.6), we find that 

C¥\H 2 m = Nl <<D h I#2^I<J>h> • (9-9) 
Substituting the explicit form of si, we find 

> \ / 1-P \ 

1 $h>, (9.10) 



0JW 2 m = XX(-l) p (<t> H — Oh} = E( <I) h 



i<j p 
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where Pij is the exchange operator of electrons i and j. This expression allows us to write 



(W\H 2 m = W\(u a (r i )up(r j 



u a (ri)up(rj) )-( u a (ri)up(rj) 



up(ri)u a (rj) 



(9.11) 



Note the swap of a and /3 in the last ket. This expectation value consists of two terms, 
namely the direct term 



J a p = { u a (ri)up(rj) 



1 



and the exchange term 



K a p = ( u a {ri)up{rj) 



u a (ri)up(rj)J , 



up{ri)u a {rj) 



The total expectation value therefore becomes 



(W\Hm = J^I a + -^{J afj -K afi ) 



(9.12) 



(9.13) 



(9.14) 



The matrix elements J a p and K a p are real and symmetric in a and /3. 

The second step towards the Hartree-Fock method is to find the minimum of E = 
(^Flif I^F) by varying the u a (r;). This means finding SE = 0. However, we must keep the 
functions u a (r;) orthonormal to each other, and this imposes iV 2 constraints. We can incor- 
porate these constraints in the variational procedure by including Lagrange multipliers 
e aj 8, and the variational equation becomes 



6E-J^e a p6(u a (r)\u^r)) = 0. 



(9.15) 



There is no explicit reference to electron positions r; in (u a (r)|u j g(r)) since we are only in- 
terested in its orthonormality properties. The Lagrange multipliers e a /5 form the elements 
of a Hermitian matrix. 

The variational approach ultimately leads to a set of iV coupled equations: 

E a u a (r;) = hiU a (ri) + Y,(up(rj)\rJj |u /6 (rj))u a (ri)-^<u i3 (r 7 )|rT ; 1 |u a (ry)) u^rt), (9.16) 

P P 

which are known as the Hartree-Fock equations. These can be solved by iteration up to 
any desired precision. 



9.2 Spin waves in solids 

Consider a system of spins with a nearest-neighbour interaction. For a uniform interaction 
in all direction, this is described by the Hamiltonian 

H = ± J £ S, • S 7 = ± J £ S z ,iS zJ + \ (S +!i S-j + S-,iS +J ) . (9.17) 

(ij) (U) Z 
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where J > is the coupling strength between the spins, E(j,y) is the sum over all neigh- 
bouring pairs, and S+=S X ± iS y . The physics described by this Hamiltonian is known as 
the Heisenberg model. The sign of the coupling (here made explicit) determines whether 
the spins want to lign up in parallel (— J) or antiparallel (+J). The former situation de- 
scribes ferromagnets, while the later describes anti-ferromagnets. The spin operators for 
different sites (i ^ j) commute with each other, while the spin operators at the same site 
(i = j) obey the spin algebra of Eq. (7.24). 

Both systems have a well-defined ground state. For the ferromagnet this is the tensor 
product of the ground state of each individual spin. We are interested in the behaviour 
of the excitations with respect to this ground state. Due to the large degeneracy in the 
system (all the spins are of the same species with the same coupling J) the excitations 
act as identical quasi-particles. Consequently we can describe them using creation and 
annihilation operators. It turns out that they behave like bosons. You can think of an 
excitation as a higher spin value at some site that propagates to its neighbours due to the 
interaction. This is called a spin wave. 

Suppose that the spins are aligned in the positive 2-direction (so we consider -J), 
and S z has the maximum eigenvalue s. When the spin is lowered by h, this creates an 
excitation in the system, because the spin is no longer lined up. So the 2-component of the 
spin at site j is given by 

S zJ = s-d]dj, (9.18) 



where d -dj is the operator for the number of excitations at site j. Since S+ raise and 



j 

lower the eigenvalue of S z , we expect that S+ oc d and S- cx d^. When we insist on the 
commutation relation = 2S Z , they become 



S+j = (2s-dXdj\ 2 dj and S-j = (is-atajY dt. (9.19) 



This is known as the Holstein-Primakoff transformation. 

For small numbers, the operators S+ can be approximated as 

t 



S +J ^V2sdj and S-j^V2sd]. (9.20) 



This allows us to write the Heisenberg Hamiltonian of Eq. (9.17) with - J to lowest order 
as 



H = - J £ s 2 + s [d^dj + djdt -dtdj -d^.d 7 ) 



(9.21) 



For a simple cubic lattice of side L, lattice constant a and total number of spins iV = (L/a) 3 
we expect the spin waves to have wave vectors 

2tt 

k=— (m,n,o) with m,n,o£N, (9.22) 
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and 1 < m,n,o < L. The spin sites must now be indicated by a vector r instead of a single 
number j, and the Fourier transformation of d\. and d r is given by 



1 



a r =^£>- /k - r d k and 4= -=E eik " r "k' (9.23) 



/NT VN 
which transforms the Heisenberg Hamiltonian to 

H = -BJs 2 N - ^ E E ^ r(k " k,) (e iA (l - l) d k d(k') 

N r,dk,k' 

= -3 Js 2 N - ^ Y.(V)e(k) d k d k , (9.24) 

where r is the position of a lattice site, d is the vector from a site to its nearest neighbours, 
which takes care of the sum over nearest neighbours. This is a diagonal Hamiltonian with 
eigen-energies 

e(k) = 2 Js (3 - cosk x a - cos^a - cosk z a). (9.25) 

This is the dispersion relation for the spin waves, and to lowest order (cosx - 1 - \x 2 ) it is 
quadratic: 

e(k) = Jsa 2 k 2 . (9.26) 

Spin waves are important when we want to manipulate magnetic properties with high fre- 
quency, such as in microwave devices. They carry energy, and are therefore a mechanism 
for dissipation. 

For the case of anti-ferromagnets (+J), the ground state is harder to find. Consider an 
anti-ferromagnet that is again a simple cubic lattice with alternating spin ±s and lattice 
constant a. We can think of this lattice as two sub-lattices with constant spin, and redefine 
the spins on the -s sub-lattice according to 

S x ^-S x , Sy^Sy, and S z ^-S z . (9.27) 

These operators still obey the commutation relations of spin (which S — -S would not), 
and the Heisenberg Hamiltonian becomes 

H = - ± J E S 2 ,iS zJ + \ {S +!i S +J + S-,iS-j) . (9.28) 

When we apply the Holstein-Primakoff transformation to this Hamiltonian, to first order 
we obtain 

H = - J H \s 2 + s[d\di + b]bj + &ibj + a\b^\, (9.29) 
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where d\ and d; are the creation and annihilation operators for the spin +s sub-lattice, and 
St and bj are the creation and annihilation operators for the original spin -s sub-lattice. 
After the Fourier transform of the creation and annihilation operators we get 



H = -3Js 2 N + 3Js £ [d k d k + &t k 6- k + /Xk) (d k 6-k + d k &t k ) 



(9.30) 



where /"(k) = ^(cosk x a + cos^ y a + cos^a). 

To find the ground state we must diagonalise H so that it is a sum over number op- 
erators. This will involve mixing creation and annihilation operators. This is a unitary 
transformation that can be written as 

dk = "kCk-^kdl k and &_ k = « k d_ k -D k c[. (9.31) 

This leads to the Hamiltonian 

H = -3 J s(s + 1)N + £>(k) (c k c k + d! k d- k + l) , (9.32) 

k 

with the spin wave energy 

e(k) = 3Js (l-/(k) 2 )^ . (9.33) 

For small k the dispersion relation of the spin wave is linear in the wave vector, e(k) = 
\/3Jsak, which means that the spin waves behave markedly different in ferromagnets 
and anti-ferromagnets. 



9.3 An atom in a cavity 

Consider a two-level atom with bare energy eigenstates \g) and |e> and energy splitting 
H(x)q. The free Hamiltonian H of the atom is given by 

H = ^too(J ^j. (9.34) 

The atom interacts with an electromagnetic wave, and the interaction is approximately 
the coupling between the dipole moment d associated with the \g) <-► \e) transition and the 
electric field E, leading to the interaction Hamiltonian 

H int = -d-E, (9.35) 

where d = -er, and E is the classical, complex-valued electric field at the position of the 
atom. For an atom at the position r = 0, the electric field can be written as 



E = <g ee ia)t + g e*e- 



(9.36) 
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where Sq is the real amplitude of the electric field e is the polarsation vector of the wave. 
The off-diagonal matrix elements of Hint are given by 

(e\H int \g) = eS {e\r\g) • e e iwt + c.c, (9.37) 

and c.c. denotes the complex conjugate. Since r has odd parity, the diagonal matrix ele- 
ments (g\Hi nt \g) and (e\Hi nt \e) vanish. When we define r eg = (e\r\g), the total Hamilto- 
nian becomes 



H 



\hm eSor* eg -(ee i ^ + e*e-^ t ), 

e<g r eg -(ee iwt + e*e- ia)t ) -\ho) 



Using the Rotating Wave Approximation (see exercise 9.1), this Hamiltonian can be writ- 
ten as 



tl — — _ 

2 n -i 



(9.39) 



where we made the substitution 



2e<gb 

v = a»-wo and 0= r eg -e. (9.40) 

h 

We can use the standard matrix techniques in quantum mechanics to solve for the eigen- 
values, the eigenstates, and the time evolution of the atom. 

Next, we consider the situation where atom is placed inside a cavity of volume V, and 
the electric field in the cavity has angular frequency co with wave vector k propagating in 
the z -direction. Assume that the length of the cavity is a multiple of A/2, such that co is a 
resonant cavity mode. The field is very weak, so that the classical description of E is no 
longer sufficient. In particular, the field is made of photons, i.e., identical bosons. Conse- 
quently, we need to express E in terms of bosonic creation and annihilation operators d^ 
and d, which create photons of frequency 10. Since the intensity of the field is proportional 
to both and d^d, we expect the electric field to be proportional to the creation and an- 
nihilation operators themselves. Hermiticity requires that it is proportional to the sum 
of the creation and annihilation operators. Furthermore, the electric field is a transverse 
standing wave cavity mode and must vanish at the mirrors due to the boundary condi- 
tions imposed by Maxwell's equations. The spatial amplitude variation therefore includes 
a factor sinkz. For linear polarisation the operator form of E then becomes 



E(z, t) = e y (d e~ iwt + a* e ia)t ] sinkz , (9.41) 

where we assumed that e is real 8 . The creation and annihilation operators are thus the 
amplitude operators of the field. Note that by the analogy with the harmonic oscillator, 



This is true for linear polarisation. For elliptical polarisation e will be complex. The subsequent deriva- 
tion will be slightly modified (with more terms in Hi nt ), but no extra technical or conceptual difficulties 
arise. 
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the electric field operator acts as a position operator of a particle in a harmonic potential 
well characterised by a>. 

We again consider the dipole approximation of the atom in the field, and the Hamilto- 
nian is written as 



H int = -d-E = er-e^ — sinkz (d e~ iwt + d f e iwt } . (9.42) 
The operator r can be written as 

r = r eg \e)(g\+r* eg \g)(e\, (9.43) 
and for notational simplicity, we define the coupling constant g as 



/ ha) 

g = er eg -e^l— — sinkz. (9.44) 

We again calculate the matrix elements of H{ nt as before, but this time we write the oper- 
ator in terms of \g) (e\ and \e) (g\: 

H int = g\e){g\ (ae- iwt + a*e i(0t )+g* \g){e\ [a e~ iwt + a* e ia>t ) . (9.45) 

It is convenient to express \g) (e\ and \e) (g\ in terms of the two-level raising and lowering 
operators cr+ and cr_: 

a+ = \e)(g\ and a- = \g)(e\, (9.46) 

with the commutation relation 

[a + ,(T-] = 2a 3 with a 3 = \g)(g\-\e)(e\. (9.47) 

The interaction Hamiltonian becomes 

H in t = g o+ (d e- iwt + d f e iwt ) + g* a- (d e~ iM + d f e iwt ) . (9.48) 

Including the free Hamiltonian for the field and the two-level atom, this becomes in the 
Rotating Wave Approximation 

Hjc = ^hcoo as + Ha) d^d + ga+d + g* cr_ d^ , (9.49) 

This is the Jaynes-Cummings Hamiltonian for a two-level atom with energy splitting hwo 
interacting with a cavity mode of frequency co. To achieve the strongest coupling g, the 
volume of the cavity should be small, and the atom should sit at an anti-node of the field. 

Quantities are conserved when they commute with the Hamiltonian. We can identify 
two observables that satisfy this requirement, namely the number of electrons 

P e = \g)(g\ + \e)(e\ = K (9.50) 
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and the total number of excitations 

# e =d + d + |e><e|. (9.51) 

This means that the Hamiltonian will not couple states with different total excitations. 

In a real system, the cavity will not be perfect, and the excited state of the atom will 
suffer from spontaneous emission into modes other than the cavity mode. This can be 
modelled by a Lindblad equation for the joint state p of the atom and the cavity mode. 
The Lindblad operator for a leaky cavity is proportional to the annihilation operator d, 
with a constant of proportionality \/k that denotes the leakage rate. The spontaneous 
emission of the atom is modelled by the Lindblad operator sjyo-. The Lindblad equation 
then becomes 

d pi *y 7c 

— = — [Hjc.p] +jO-pu+- -{a+a-,p} + Kdpd j; - -W&,p). (9.52) 

The research field of cavity quantum electrodynamics (or cavity QED) is devoted in a large 
part to solving this equation. 



9.4 Outlook: quantum field theory 

We have surreptitiously introduced the basic elements of non-relativistic quantum field 
theory. Consider again the Heisenberg model, where we described a lattice of spins with 
nearest-neighbour interactions. If we take the limit of the lattice constant a —* we end 
up with a continuum of creation and annihilation operators for each point in space. This 
is a field. 

Traditionally we construct a quantum field theory from harmonic oscillators at each 
point in space. To this end, we characterise a classical harmonic oscillator with mass M 
in terms of its displacement q and velocity q. The equations of motion for the classical 
harmonic oscillator are the Euler-Lagrange equations 



where L is the Lagrangian 



d dL dL 

^7^-^- = °' (9 " 53) 
dt oq oq 



L = ^Mq 2 -^Kq 2 , (9.54) 



and K can be thought of as a spring constant. Substituting this L into Eq. (9.53) yields 
the familiar differential equations for the harmonic oscillator q + VL 2 q = 0, with D, 2 = KIM. 

Next, we arrange iV particles in a one-dimensional lattice of length L and lattice con- 
stant a, where L = Na. Each particle's displacement is coupled to the displacement of it's 
nearest neighbours by a spring with constant K. The equations of motion of this set of 
coupled particles is given by 



q n = D. 2 [(q n +i ~ q n ) + (<?«-! - <?«)] • 



(9.55) 
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We take the limit of a —* and N — ► oo while keeping L = Na fixed. Our variable q n (t) then 
becomes a /ieZ<i u(x, t), and it takes only a few lines of algebra to show that 



u(x, t) = a 2 D. 2 u"(x, t) = v z u"(x, t) 



2, Jii 



with 



limaD, = v . 

a— 



(9.56) 



This is a wave equation, and v is the velocity of the wave. We can generalise this immedi- 
ately to three dimensions: 



d 2 u d 2 u d 2 u 1 d 2 u 
dx 2 dy 2 dz 2 v 2 dt 2 



0. 



(9.57) 



In quantum field theory, we consider only these waves as the field excitations, and ignore 
the underlying particle structure we used to arrive at this result. We have already done 
something similar when we considered the spin waves in the Heisenberg model. Note also 
that the finite speed of propagation of waves means that we can make the theory Lorentz 
invariant when v = c, the speed of light, and Eq. (9.57) becomes u = 0. 

The wave equation is typically derived from a Lagrangian L, or in the case of a field 
theory, the Lagrangian density 5£ . A massless scalar field is described by the Lorentz- 
invariant Lagrangian density 



5£ = 



'd(J)\ 
,dx j 



+ 



k dx ) { dx j 



'd(pY 
,dx j 



(9.58) 



where for technical reasons we redefined (p = u/y/a. The dispersion relation for such a 
field is c k - ft) , with k the wave number and co the frequency of the wave. Similarly, a 
massive field is described by 



(9.59) 



The Euler-Lagrange equation for this Lagrangian density is the so-called Klein-Gordon 
equation 



= 0. 



The mass term leads to a new dispersion relation 

2j-2; 2 j-2 2 , 2 4 n 

c n k - n or + m c = , 

and the group velocity for wave packets is 

c 



do) 
dk 



with 



mc 



(9.60) 



(9.61) 



(9.62) 



For relativistic particles the momentum hk is much larger than the rest mass mc, and 
therefore v g approaches c. 
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We can solve the Klein-Gordon equation formally by writing 

dk 

v / 2(27T)-to k 



0(r,f)= [ (a(k)e ik - I - iat + a*(h)e- ik - r+iwt ), (9.63) 



where ak is the complex amplitude of a wave with wave vector k. The field is essentially 
a superposition of (non-interacting) eigenmodes labelled by k, and we call this a free field. 
We can introduce interactions between the waves in the field by adding higher-order terms 
to 5£ with coupling constants v, A,. . . 

11 1 

<£ = -0^0X^0) - ^ ™ V " ^ 3 " ^ 4 " • • • (9-64) 

The solution to the Klein-Gordon equation is now no longer the correct solution to the new 
equations of motion, but when the interaction is reasonably weak we can use the solutions 
<p(r,t) of the free field as a starting point in a perturbation expansion. 

So far, everything in this section has been a classical treatment. In order to extend the 
theory to quantum mechanics we have to quantise the field. We achieve this by promoting 
the amplitudes in <p (and therefore <p itself) to operators that obey commutation of anti- 
commutation relations 



$(r,t)= [ (&<}L)e ik - r - iakt + tf(Me- ik - r+Uokt ). (9.65) 

J x/2(2n) 3 <,^ { 1 



These are the creation and annihilation operators for excitations of the field. For the 
Klein-Gordon equation they obey the bosonic commutation relations 

[d(W,d(k')] = [o t (k) ) d t (k')]=0 and [a^d^k')] = <5 3 (k-k'). (9.66) 

The state of the field can then be written as a superposition of Fock states. The field <p has 
now become an observable. 

In quantum field theory, the excitations of the field are interpreted as particles. All 
fundamental particles like quarks, electrons, photons, and the Higgs boson are excita- 
tions of a corresponding field. So the excitations of the Higgs field are Higgs bosons, and 
the excitations of the electromagnetic field are photons. Spin-| particles obey the Dirac 
equation 

{ihr^d^-mc)y = 0, (9.67) 

where the are 4x4 matrices 

H J)- H°, T) ■ -{I 7). H° t). — 
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and if/ is a four-dimensional vector field called a spinor field. The solution to the free Dirac 
field can be written as 



/r/k r 
; & g (k) a 8 (k) e*-^ + 3J(k) v s (k) e -**^' 



(9.69) 



where s = +|/z is the spin value, and u s (k) and u s (k) are two spinors carrying the spin 
component of the field 



Mk) = ^f heal ) and y g (k) = ^f (9.70) 
where is a normalisation constant, cr is a vector of Pauli matrices, and 

I+ = fi) and *- = [?]. (9.71) 



oj uj" 

The creation and annihilation operators 6 s (k) and dj(k) obey anti-commutation relations: 

{6 s (k),6 r (k')} = {6j(k),SJ(k')} = and |S s (k), S^k')} = <5 rs <S 3 (k-k'), 
{d s (k),d r (k')} = {dl(k),dj(k')} = and |d s (k),dj(k')} = <5 rs <5 3 (k-k'), (9.72) 
{6 s (k), 3 r (k')} = {6j(k), 2J(k')} = and |S s (k), 2J(k')} = |d s (k), £J(k')} = . 

This means that the excitations of the Dirac field are fermions with spin such as the 
electron. You see immediately that iff is not Hermitian due to the appearance of d\ This 
means that xfr is not an observable and we cannot think of the Dirac field as a quantised 
version of a classical field. There is no classical analog to the Dirac field. This is a con- 
sequence of the fact that the anti-particle of the Dirac excitations are not the same as 
the particle itself. E.g., the positron is different from the electron. Anti-particles are a 
quintessential^ quantum mechanical phenomenon. 

There is of course a lot more to quantum field theory than this. For example, the tech- 
niques for doing the perturbation expansion of interacting fields leads to Feynman dia- 
grams, and renormalisation theory must be employed to deal with the infinities that crop 
up in the perturbation theory. Furthermore, one has to choose the right Lagrangian den- 
sity, and principles such as gauge invariance and CPT invariance are imposed to constrain 
the possible choices. This leads ultimately to the extraordinary successful Standard Model 
of particle physics. It the most fundamental theory of Nature that we have, and it is tested 
to unprecedented accuracy. 
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Exercises 

1. The Hamiltonian for a two-level atom in the presence of an electromagnetic wave, 
as given in Eq. (9.38) depends on the time t. This makes it difficult to solve the 
Schrodinger equation, so in this exercise we will get rid of the time dependence by 
applying the Rotating Wave Approximation. 

(a) If \ f'(t)) = U(t)\y/(t)), find the Schrodinger equation for \ f'(t)). What is the 
new Hamiltonian? 

(b) Choose U(t) such that 

JT .._[e iwt ' 2 \ 

U(t) -{ e- iM, *y 

and H is given by Eq. (9.38). Show that the new Hamiltonian is given by 
Eq. (9.39). 

2. A two-level atom is placed in a perfect cavity with an electromagnetic field of fre- 
quency (D. 

(a) Show that the Jaynes-Cummings Hamiltonian can be written as a direct sum 
of 2 x 2 matrices H n , and specify H n . 

(b) Diagonalize H n to find the energy values of the system, and calculate the eigen- 
states. 

(c) At t = 0, the system is in the state |i//"(0)) = \e,n). Calculate the state \y/{t)). 

(d) Calculate the amount of entanglement between the atom and the cavity field. 
Use the relative entropy as the entanglement measure. 

(e) How long does the atom need to reside in the cavity in order to achieve maxi- 
mum entanglement? 



